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ABSTRACT 


In  recent  years,  there  has  been  a  considerable  increase  in  both  the 
variety  and  number  of  sensors  which  needed  to  be  tied  together.  A  new 
distributed  estimation  architecture  for  Distributed  Sensors  Networks  (DSN)  is 
introduced.  It  is  called  Horizontal  Estimation  Architecture  (HEA).  The  term 
horizontal  is  used  to  imply  that  the  geographically  dispersed  nodes  do  not 
differ  in  rank  and  are  peer-to-peer  coupled.  Each  node  is  connected  by  a 
data  link  to  its  neighbors  (where  possible),  thus  providing  a  mesh  network 
topology.  The  introduced  HEA  has  four  major  components,  the  local 
estimator,  the  information  fusion  process  (both  together  are  called  a 
horizontal  estimator),  the  network  access  protocol,  and  the  controller  - 
decisionmaker. 

The  HEA  techniques  are  applied  to  the  solution  of  Multitarget  and 
Multisensor  Tracking  (MMT)  problems  in  Track-While-Scan  (TWS)  systems 
with  an  emphasis  towards  track  fusion.  A  mathematical  framework  which 
encompasses  the  components  of  the  horizontal  estimator  is  developed,  with 
an  emphasis  towards  the  track  fusion  algorithm.  An  artificial  intelligence 
approach  using  expert  systems  for  track  fusion  has  been  presented.  Through 
this  HEA  application  its  main  features  and  practical  usefulness  are 
addressed. 
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I.  INTRODUCTION 


In  recent  years,  there  has  been  rn  increasing  interest  in  Distributed 
Sensors  Networks  (DSN).  Examples  can  be  found  in  Air  Traffic  Control  ( ATC) 
systems,  surveillance  systems,  and  air  defence  systems.  In  these  systems, 
computers  are  sited  essentially  at  the  sensor  sites,  or  in  the  display  system, 
and  in  the  command  and  control  areas.  As  a  consequence,  a  distributed 
sensors  network  implies  a  computer  network,  which  ensures  performance  of 
data  processing,  organization  of  information  display,  and  the  communication 
between  the  different  network  components,  in  addition  to  an  estimation  and 
decision  making  processes. 


A.  MAIN  DSN  DEVELOPMENT  ISSUES 

There  are  many  different  issues  which  arise  in  the  development  of  a 
DSN.  The  fundamental  issues  are  : 

1.  The  design  of  multisensor  architectures. 

2.  How  much  local  processing  capability  should  a  sensor  have,  and  how 
should  the  data  communicated  by  a  sensor  be  summarized  and 
compressed? 

3.  The  procedures  for  data  fusion. 

4.  The  performance  evaluation  of  a  multisensor  configuration. 


5. 

6. 


The  on-line  management/control  of  an  implemented  muitisensors 
oetwork. 

Provision  of  software  and  hardware,  bearing  in  mind  the  growing, 
restructuring  and  reconfiguration  capabilities. 
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These  issues  are  difficult.  They  become  more  so  in  the  multitarget 
environment  faced  in  aircraft,  missile,  battlefield,  and  ocean  surveillance.  In 
these  environments,  there  is  the  added  problem  of  data  association.  That  is. 
given  a  piece  of  data,  one  must  determine  to  what  it  should  be  attributed. 
Should  it  be  one  of  the  current  objects  being  considered,  and  if  so,  which?  or. 
should  it  be  a  new  object,  or  a  false  alarm? 

General  architectures  of  multi  sensor  networks  are  centralized  and 
distributed.  Several  problems  arise  in  the  analysis  and  design  of  the 
mentioned  architectures,  among  them  are: 

a.  Air-Space  management,  i.e.  allotment  of  airspace  sectors  to  the 
sensors  of  the  network. 

b.  Gathering,  routing,  management  and  dissemination  of  data  and 
results  through  the  communication  network. 

c.  Organization  of  sensors  and  processors. 

B.  REQUIREMENTS  FOR  DEVELOPMENT  OF  DSN 

There  are  technological  advances  in  several  disciplines,  which  are 
providing  tools  for  designers  of  DSN.  An  adequate  development  of  DSN 
requires  the  successful  integration  of  these  disciplines,  e.g,  modern  theory  of 
systems  and  control,  computer  science,  communications,  man- machine 
interactions,  expert  systems,  information  systems,  reliability  and 
maintainability.  In  defense  systems  especially,  there  is  an  increasing 
interest  in  simultaneously  using  various  types  of  sensing  techniques  co¬ 
operating  with  each  other,  such  as  various  types  of  radars,  infrared  detecting 
passive  radio  surveillance,  and  laser  systems.  The  combination  gives  rise  to 
a  system  concept  which  requires  location  of  objects  and  detection  over  a 
wide  frequency  spectrum.  This  decreases  the  effect  of  unintentional  and 
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intentional  interferences,  and  increases  the  system  reliability  and 
survivability.  In  addition,  the  use  of  passive  sensors  improves  the  system's 
covertness.  Also,  a  defense  system  is  required  to  remain  reliable  and  secure 
in  the  presence  of  intense  hostile  activity,  winch  will  probably  include 
destructive  actions. 

Today,  the  communication  is  both  slower  and  more  costly  than 
computation  and  processing.  Present  trends  indicate  that  these  imbalances 
in  speed  and  costs  of  communications  and  processing  will  not  only  continue 
but  are  likely  to  greatly  increase  (Ref.  1  ].  Military  communication  systems 
are  generally  designed  and  tested  in  a  peacetime  setting.  However,  their 
greatest  test  will  come  in  a  conflict  situation,  when  the  need  to  communicate 
becomes  great  and  hostile  enemy  actions,  including  physical  destruction  and 
electronic  countermeasures  will  create  node  and  link  failures  and  a 
dynamically  changing  network  topology.  So,  one  of  the  prime  motivations  in 
DSN  is  to  try  to  minimize  the  communication  load  between  different  sites  or 
nodes. 

Since  the  sensors  are  the  means  by  which  a  decision  making  process 
observes  the  environment ,  which  is  generally  a  dynamically  changing 
environment  in  the  presence  of  uncertainties,  sensor  measurements  are  used 
to  reduce  these  uncertainties,  and  determine  the  current  state  of  the  system. 
So,  DSN  implicitly  includes  an  estimation  process  in  addition  to  a  decision 
making  process.  The  future  trends  of  decision  making  in  complex 
technological  environments  are  towards  decentralized  strategies.  These 
trends  are  being  supported  by  the  fruitful  progress  in  the  direction  of 
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distributed  computer  architectures,  distributed  data  processing,  distributed 
knowledge  based  systems  and  fifth  generation  computers. 

The  Japanese  Fifth  Generation  Computer  Systems  (FGCS)  project  is 
directed  toward  nonnumeric  applications.  The  more  traditional  systems 
application  of  number  crunching  is  not  an  issue  in  the  FGCS.  It  is  expected 
that  the  supercomputers  being  developed  today  will  perform  these  tasks. 

The  FGCS  project  is  directed  instead  toward  applied  artificial  intelligence  and 
symbol  manipulation.  In  other  words,  the  next  generation  of  computers  is 
being  designed  to  manipulate  knowledge.  It  will  employ  and  exploit 
knowledge-based  system  technology. 

C.  DISTRIBUTED  ESTIMATION  AECHITECTURES 

A  distributed  estimation  architecture  for  DSN  is  introduced  bearing  in 
mind  previously  mentioned  considerations.  Research  in  distributed 
estimation  has  progressed  along  several  directions.  A  survey  of  these 
directions  can  be  found  in  the  paper  by  Chong,  Tse  and  Mori  (Ref.  2).  and  its 
references  (Ref.  3.4.5.61.  The  estimation  architectures  used  are  mainly 
hierarchical  and  centralized.  In  our  approach,  the  Horizontal  Estimation 
Architecture  (  HEA  )  is  introduced.  The  term  horizontal  is  used  to  imply  that 
the  geographically  dispersed  nodes  do  not  differ  in  rank.  They  are  of  equal 
status  and  peer-  to  -  peer  coupled,  and  emphasizing  the  nonhierarchicai 
architecture  used. 

The  practical  usefulness  and  the  performance  of  HEA  techiques  are  better 
described  with  reference  to  the  particular  applications  for  which  they  are 
designed.  Multitarget  Multisensor  Tracking  (MMT)  problems  in  Track  - 
While-Scan  (TWS)  systems,  has  been  selected  as  a  specific  problem  to  be 


16 


addressed,  emphasising  the  information  (track)  fusion  process.  A 
mathematical  model  for  MMT  is  presented.  An  algorithm  for  pairvise  track 
fusion  is  derived  and  the  optimality  of  HEA  track  fusion  is  justified.  An 
artificial  intelligence  approach  using  expert  systems  for  track  fusion  has 
been  presented.  Through  the  HEA  application  its  main  features  and  practical 


usefulness  are  addressed. 


CENTRAL 

ESTIMATOR 


PROCESS  ENVIRONMENT  WITH  STATE  X 


FIGURE  1.1.  Centralized  Estimation  Architecture 
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FIGURE  1.2.  Hierarchical  Estimation  Architecture 
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II.  HORIZONTAL  ESTIMATION  ARCHITECTURE 


A.  MAIN  FEATURES  OF  HEA 

The  main  concept  of  distributed  estimation  applied  to  DSN  is  used. 
Specifically,  each  node  in  the  network  performs  local  processing  of  the  data 
collected  only  by  its  own  sensor  or  sensors.  The  locally  processed  data  from 
each  node  are  communicated  through  an  appropriate  data  link  to  its 
neighboring  nodes  (where  possible),  in  addition  to  using  it  locally.  Thus 
providing  a  mesh  network  topology  (  Figure  2.1 ). 


q  a  Network  Node 

-  •  Communication  Link 

FIGURE  2.1.  Mesh  Network  Topology 
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B.  GRAPH  REPRESENTATION  OF  A  MESH  NETWORK 

Using  graph  theory  with  some  modifications,  we  represent  the  mesh 
network  as  a  directed  graph  G  (Ref.  7:pp.  -424-4731. 

Definition:  Let  V  be  a  finite  set.  A  mesh  network  can  be  represented  as  a 
directed  graph  (or  digraph)  G  on  V,  made  up  of  the  elements  of  V,  called  the 
nodes  of  G,  and  a  subset  E  of  the  cross  product  V  x  V.  called  the 
communication  links  of  G.  If  a.b  €  V  and  (a.b)  €  E.  then  there  is  a 
communication  link  from  a  to  b.  Node  a  is  called  source  of  the  link,  with  o 
the  terminus,  or  terminating  node,  and  we  say  that  b  is  neighbor  to  a,  and  a 
is  neighbor  to  b.  It  is  asumed  that  G  is  loop  free,  and  there  are  no  isolated 
nodes.  Generally  each  node  is  a  source  and  terminus. 

Based  on  that  definition,  let  V  be  the  set  of  nodes,  while  E  is  the  set  of 
communication  links,  and  it  is  a  subset  of  the  cross  product  of  the  sets  V  and 


V,  i.e 

EC(VxV)  (2.1) 

where  V  x  V  *  ( (  a,b.c . 41)  I  a,b,c . 41  €  V  )  (2.2) 

and  we  write  the  mesh  network  G  as 

G-(V.E)  (2.3) 


where  G.  usually  is  not  fully  connected,  i.e.  for  all  i.y  e  V.  1  y.  there  is  not 
necessary  a  link  from  1  to  y.  In  mesh  networks,  a  link  is  often  directed  in 
both  directions  (bidirectional).  Consequently,  if  G  is  a  directed  graph  and  a.b 
€  V.  a  f  b.  with  both  (a,b) ,  (b.a)  e  £  .  the  undirected  link  in  Figure  2.2(b),  are 
used  to  represent  the  bidirectional  link  shown  in  Figure  2.2(a).  In  this  case, 
a  and  b  are  called  neighboring  nodes,  and  it  is  represented  as 
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(la,b))  -  ((a,b),(b,a)}.  In  HEA  tbe  data  processing  capacity  can  vary  from  node 
to  node,  and  communication  capacity  can  vary  from  link  to  link. 


Figure  2.2.  Representation  of  a  Bidirectional  Link 


e 


v  «  (  a,b,c,d,e,f  ] 

E  *  ( [a,D],  [a,ej,  [a,f),  (b.fj,  (e.f],  (b.c),  (d,c),  (dfe)  j 
G  *  (V,E) 

Figure  2.3.  Mesh  Network  Representation  Using 
Oirected  Graphs 
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Example:  The  mesh  network  shown  in  Figure  2.3.  is  represented  as 

G-(  V.B) 

where  V  -  { a,b,c,d.e/ } 

and  V  x  V  -  { (a,b,c,d,e/)  I  a.b.c.d.e/  e  V  } 

and  EC(VxV) 

E  -  { (a.b).  (a,el,  la/1,  lb/1,  (e/1,  (b.c),  (d.c).  (d.el } 

It  is  dear  that  (b,c)  indicate  a  unidirectional  link  from  b  to  c,  and  [a.bl 
indicate  a  bidirectional  link  between  a  and  b. 

C.  PROPOSED  DSN  INT&LECTUAl  ASSETS 

In  the  literature,  data  and  information  are  used  interchangeably. 

Actually,  data  and  information  are  not  synonymous.  There  is  a  distinction 
between  them.  Data  become  information  only  when  they  are  useful  and 
available.  This  means  that  information  is  produced  as  output  of  data 
processing  operations  and  used  to  enhance  understanding  and  to  achieve 
specific  purposes  (Ref.  8J.  Figure  2.4,  shows  a  pictorial  view  of  a  proposed  . 
DSN  intellectual  assets  shown  as  a  four  layer  pyramid.  The  bottom  layer  is 
the  data  collected  by  sensors,  which  are  processed  using  a  data  processing 
mechanism.  The  results  of  the  data  processing  are  information,  which  is  the 
second  layer,  and  it  is  apparently  the  processed  data.  The  information  added 
to  the  expertise  of  a  human  expert,  resulting  in  knowledge,  which  is  the 
third  layer.  The  knowledge  is  used  for  developing  the  knowledge-base, 
which  is  the  core  of  the  expert  system.  The  expert  system  is  used  for  aiding 
the  controller-decisionmaker  in  giving  intelligent  decisions,  which  is  the 
utmost  purpose  of  the  DSN.  It  is  noted  that  the  human  expert  is  kept  in  the 
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loop  from  time  to  time  to  adapt  the  knowledge-base  to  cope  with  the 
changes  in  any  situation  affecting  the  environment  sensed  by  the  sensors. 

Based  on  that,  the  result  of  processing  data  at  each  node  taken  by  its  own 
sensor  or  sensors  is  local  information,  which  is  tied  or  fused  with  the 
incoming  information  from  neighboring  nodes  to  obtain  an  updated  situation 
assesment  at  each  node  and  forming  a  coherent  picture  that  resembles  as 
closely  as  possible  what  is  happening  in  the  local  area  of  interest. 
Consequently,  we  have  termed  the  bringing  together  of  locally  processed 
data  from  different  neighboring  nodes  into  a  coherent  picture  "  information 
fusion'. 

In  fact,  by  information  fusion  we  also  mean  to  include  ail  sources  of 
information,  not  just  that  from  processing  electromagnetic,  acoustic,  optical 
and  infrared  sensors  data.  There  are  for  example  in  defense  systems,  human 
observers  providing  intelligence  information  and  a  background  of 
encyclopaedic  information  and  operational  plans  (Ref.  91.  Also,  fusing  local 
information  and  incoming  information  from  neighboring  nodes  can  be  done 
only  after  deciding  that  they  are  originated  only  from  the  same  origin.  So, 
the  fusion  process  implies  a  decision  process. 
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Figure  2.4.  Pictorial  view  of  a  ProDOseu 
DSN  intellectual  Assets 
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0.  INFORMATION  ELEMENTS  OVERLAPPING  IN  HEA 

Tbe  principle  of  inclusion  and  exclusion  is  adopted  with  slight 
modifications  to  represent  the  information  overlapping  of  different  nodes  in 
HEA  (Ref.  7:pp.  190-2131.  Let  S  be  a  set  with  N-ISI.  where  ISI  is  the 
cardinality  or  size  of  S.  i.e.  denotes  the  number  of  elements  of  S  (in  MMT 
case  it  can  be  the  number  of  tracks).  Let  q,  c2 . q  be  a  collection  of 

conditions  or  properties  satisfied  by  some,  or  alL  of  the  elements  of  S.  Some 
elements  of  S  may  satisfy  more  than  one  of  the  conditions  (e.g.  range  and 
bearing  in  MMT  case),  while  some  may  not  satisfy  any  of  them.  For  l  £  i  i  t. 
N(q)  will  denote  the  number  of  elements  in  S  that  satisfy  condition  q  For  i,  j 

€  ( 1.2.3 . ,t).  if),  N(qq)  wiil  denote  the  number  of  elements  in  S  that 

satisfy  both  of  the  conditions  q.q.  N(qq)  does  not  count  the  elements  of  S 
that  satisfy  only  one  of  the  conditions  q  and  Cj.  If  1  t  i,  j.  k  t 1  are  three 
distinct  integers,  then  N(qqck)  denotes  the  number  of  elements  in  S 
satisfying  each  of  the  conditions  q.  q  and  ck  (e.g,  range,  bearing  and  velocity 
in  MMT  case).  For  1  5  i  *  t.  N(q)  denotes  the  number  of  elements  in  S  that 
do  not  satisfy  condition  q.  In  this  case,  N(q)  -  N-  N(q).  If  I  i  i.  j  5 1.  \f\, 
N(qq)  -  the  number  of  elements  in  S  that  do  not  satisfy  both  of  the 
conditions  q  and  q. 
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FIGURE  2.5.  Two  Information  Elements  Inclusion  and  Exclusion 


From  the  Venn  diagram  in  Figure  2.5,  it  is  seen  that  if  N(q)  denotes  the 
number  of  elements  in  the  left-hand  circle  and  N(cj)  denotes  the  number  of 
elements  in  the  right-hand  circle,  then  N(qcj)  is  the  number  of  elements  in 
the  overlap,  while  N(qq)  counts  the  elements  outside  the  union  of  these 
circles.  Consequently,  N(cjCj)  -  N  -  (N(q)  *  N(q))  ♦  NfcjCj),  where  the  last  term 
is  added  ou  since  it  was  eliminated  twice  in  the  term  (N(q)  ♦  N(Cj)J.  In  like 
manner,  from  Figure  2.6. 

N(C|CjCt)  -  N  -  (N(cj)  ♦  N(q)  ♦  N(ck)l  ♦  (N(cjCj)  ♦  Nfcjc*)  ♦  N(CjC*)l  -  Nfc^) 
Generally,  the  number  of  elements  of  S  that  satisfy  none  of  the  conditions  q, 
1  ■  i  i  t,  is 

N(c,C2C3 . q)  -  N  -  (N(cj)  ♦  N(c2)  ♦  N(cj)  ♦ . ♦  N(q)| 

♦  (N(C|C2)  ♦  N(C|Cj)  ♦ . ♦  N(c,q)  ♦  Nfc^)  ♦ . ♦  N(q_,q)l 
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-  (N(C|C2C3)  ♦  N(C,C2C4)  ♦ . ♦  N(C,C2Ct)  ♦  N(C,C3C*)  * . 

♦  N(c,c3q)  ♦ . ♦N(ct_2ct_,ct)l  ♦  ••••  ♦  ("1  )iN(c,c3c4  —  q). 


or 

N(C|C2C3 . Ct)-N-  £N(Cj)*  £N(CjCj)-  5TN(C|CjCk>  -  . 

I  <i<t  i_ii<j<t  !o<i<ka 

♦  (- 1  )*  N(C|C3C4 . Ct)  (2.4) 


N(C|C,ck) 


FIGURE  2.6.  Three  Information  Elements  Inclusion  and  Exclusion 

Nov,  assuming  the  number  of  elements  in  S  that  satisfy  exactly  m  of  the  t 
conditions  is  em  where  1  i  m  5 1,  then 

ej  -  N(C|C2c3 . ct)  ♦  N(c(c2c3 . q)  + . ♦  N(cjc2c3 . q),  (2.3 ) 

and 
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«2  -  N(C|CjCj . .q)  ♦  N(c,CjCj . Ct)  ♦ . ♦  N(C,C2C3 . 


(2.6) 


FIGURE  2.7.  Information  Elements  Inclusion  for  t-3 

Using  these  results  as  a  starting  place,  as  Figure  2.7  is  examined,  where 
t-3.  a  numbered  condition  is  placed  beside  the  circle  representing  those 
elements  of  S  satisfying  that  particular  condition.  Then  et  equals  the 
number  of  elements  in  region  2,  3.  4.  But  it  can  be  written  that 
e,  -  N(c,)  ♦  N(c2)  ♦  N(cj)  -  2!N(c,c2)  -  N(c,c3)  ♦  N(c2c3)l  ♦  3N(c,c2c3)  (2.7) 

In  N(cj )  ♦  N(c2)  ♦  N(c3)  the  elements  in  regions  5,  6,  and  7  are  counted  twice 
and  those  in  region  8  are  counted  three  times.  In  the  next  term  the 
elements  in  regions  3.  6.  and  7  are  deleted  twice.  The  elements  in  region  8 


29 


2L  : 


are  removed  sis  times  in  2(N(cjC2)  ♦  N(c,c3)  ♦  NtcjCj)),  so  the  term  3N(cjCjc3) 

is  added  and  end  up  not  counting  the  elements  in  region  8  at  all.  Hence. 

«,  •  S,  -  2Sj  ♦  3Sj  -  S'  -  (  f)Sj  ♦  (  j’JSj  (2.8) 

for  e2 .  the  earlier  equation  indicates  that  it  is  needed  to  count  the 

elements  of  S  in  regions  5.  6.  and  7.  From  the  Venn  diagram 
02  -  N(c,c2)  ♦  N(CfC?)  ♦  N(C2C3)  -  3N(CjC2C3) 

»  S2  -  3S3  -  S2  -  ( |)S3  (2.9) 

and 

e3  -  Nicjc^)  -  S3  (2.10) 

Generally,  for  any  1  5  m  S  t.  the  number  of  elements  in  S  that  satisfy  exactly 

m  of  the  conditions  c,.  c2,  c3 . c*  is  given  by 

e.  -  S,  •  (  7‘ )S„,  *(  72)S».2  - . *  (-1  )<-•  (t.'m)S,  (211) 

E.  COMMUNICATIONS  BETWEEN  DIFFERENT  NODES 

To  enable  communications  between  different  nodes  (processors)  the 
following  items  must  be  considered  (Ref.  10) : 

1.  The  electrical  and  physical  characteristics  of  the  medium  chosen  for 
the  interconnection. 

2.  The  signalling  used  to  ensure  the  reliable  transmission  and  reception 
of  data. 

3.  The  means  of  effecting  flow  control  in  order  to  align  the  rate  of  data 
exchange  with  the  processing  capabilities  of  the  machines. 

Actually,  when  two  information  nodes  communicate  across  a  network,  a 

network  access  protocol  between  each  node  and  network  is  needed.  The 

requirements  for  such  a  protocol  differ  significantly  for  a  communication 

networking  technique  to  another.  By  protocols,  we  mean  the  set  of  rules, 

procedures  and  conventions  by  which  information  is  transported  through  the 
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computer  communications  network.  The  key  elements  of  a  protocol  are  [Ref 
111: 

1.  Syntax:  includes  such  things  as  information  format  and  signal  levels. 

2.  Semantics:  includes  control  information  for  coordination  and  error 
handling. 

3.  Timing:  includes  speed  matching  and  sequencing. 

F.  MAIN  COMPONENTS  OF  HEA 

Based  on  the  previously  mentioned  distributed  estimation  approach. 
Figure  2.8,  shows  a  pictorial  generic  view  of  the  main  components  of  the 
Horizontal  Estimation  Architecture  (HEA).  By  architecture  we  mean  the  set 
of  algorithms,  rules,  conventions  and  protocols  that  implement  the  horizontal 
estimation  functions  and  their  interrelationships.  The  introduced  HEA  has 
four  major  components,  the  local  estimator,  the  information  fusion  process 
(both  together  are  called  Horizontal  Estimator),  the  network  access  protocols, 
and  the  controller -decisionmaker. 

The  control  policy  used  is  decentralised,  with  control  decisions  made 
independently  by  each  node  using  the  estimates  of  the  states  resulting  from 
the  information  fusion  process.  The  expected  value  of  a  quadratic 
performance  index  is  minimized  to  assure  maximum  system  robustness. 

That  is,  gradual  or  partial  system  failures  would  have  minimum  degradation 
in  the  performance.  Also,  it  avoids  the  development  of  complex  strategies 
which  are  difficult  and  costly  to  achieve  and  implement. 
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FIGURE  2.8.  Pictorial  View  of  HEA 
Major  Components 


32 


$  -  Sensor 

HE  *  Horizontal  Estimator 
C/0  -  Controller/Oecisonmaker 

FIGURE  2.9.  Distributed  Sensors  Network  (DSN) 
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Gl  proposed  functional  elements  of  a  dsn 

Based  on  the  HEA.  we  view  the  DSN  to  be  represented  as  shown  in  Figure 
Z9,  by  five  basic  functional  elements :  the  environment,  sensors,  the 
Horizontal  estimator,  data  communications  network,  and  controller- 
dectsionmaker 

Each  sensor  complex,  horizontal  estimator,  and  controller -decision maker 
is  sited  in  a  node,  which  we  call  an  information  node  10  differentiate  it. 
irom  a  communication  node.  Throughout  the  sequel,  node  is  used  to  indicate 
the  information  node.  Each  information  node  is  connected  through  a 
communications  node  and  an  appropriate  data  link  to  its  nearest  neighboring 
nodes,  thus  providing  a  mesh  network  topology.  As  mentioned  before,  each 
information  node  performs  processing  functions  by  local  estimator  using  the 
local  sensor* sensors)  data,  communicating  the  processing  results  toother 
neighboring  nodes,  in  addition  to  using  it  locally.  The  information  fusion 
process  fuses  the  information  received  from  neighboring  nodes  with  the  local 
information  The  HEA  can  be  further  eitended  to  cover  a  very  large  scale 
DSN  in  the  case  when  each  node  can  be  the  central  processor  of  a  centralised 
network,  and.  or  a  global  estimator  (parent  node)  of  a  hierarchicai  network 
'Figure  2.10* 

H  MOTIVATIONS  TO  HEA 

In  HEA,  the  approach  of  "divide  and  conquer  which  is  useful  for  most 
complex  problems  can  be  easily  used.  Partitioning  allows  any  large  complex 
system  to  be  divided  into  manageable  portions.  Another  motivation  to  the 
HEA  is  that  network  splitting  and  reformation  or  connection  of  additional 
compatible  networks  are  practicable  during  system  operation  and  do  not 
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cause  any  restriction  as  a  new  system  is  initiated.  Additional  advantages  of 
this  design  approach  are,  usage  of  microcomputer  systems,  which  provide  a 
cost-effective  solution  for  data  processing,  reliability,  survivability,  iocai 
autonomy,  heterogeneous  feature,  low  cost  communications  and  its 
practicability  for  already  existing  information  nodes,  which  need  to  be  tied 
together.  The  partitioning  approach  used  in  HEA  allows  large  complex 
systems  to  be  divided  into  manageable  proportions. 
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III.  OVERVIEW  OP  TRACK-WHILE-SCAN  SYSTEMS 


A.  TWS  RADAR  SYSTEM  CONCEPT 

The  process  erf  tracking  targets  based  on  discrete  radar  information 
obtained  while  the  radar  continues  to  scan  the  airspace  is  referred  to  as  the 
Track-While-Scan  (TWS)  processlRef.  121,  and  is  accomplished  in  a  digital 
computer  in  modern  day  surveillance  radars.  For  a  TWS  system,  search  and 
track  update  functions  are  simultaneously  performed.  In  TWS.  a  single 
sensor  scanning  at  a  constant  rate  illuminates  new  targets  and  targets 
already  in  track  files  at  the  same  time.  Also,  only  those  target  tracks  that 
remain  within  the  TWS  search  volume  can  be  maintained.  In  an  automatic 
TWS  air-surveillance  system,  the  radar  sensor  reports  measurements  of 
target  observations  at  regular  intervals  of  time  to  a  computer,  which  then 
assembles  the  observations  from  successive  scans  into  tracks.  The  computer 
software  must  correctly  associate  new  plots  with  the  existing  tracks  and 
initiate  new  tracks  from  reports  received  on  air  targets  within  range  of  the 
radar.  The  association  task  is  aided  by  tracking  filters  which  combine  noisy 
measurements  with  track  predictions  to  obtain  smoothed  updated  track 
estimates.  The  predicted  position  of  the  target  for  the  next  radar  scan,  based 
on  the  smoothed  estimate  of  the  current  position  of  the  target,  is  used 
together  with  the  estimated  standard  deviation  of  the  prediction  to 
determine  the  location  and  size  of  the  region  of  acceptability  of  new 
observations  on  the  target.  The  tracking  filter  thus  plays  an  essential  role  in 
the  function  of  plot-  to-  track  association,  in  addition  to  its  role  of  providing 
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accurate  estimates  of  the  position  and  motion  of  the  target  In  TWS  only 
position  and  velocity  states  are  estimated  because  with  low  update  rates 
( like  every  4  seconds),  the  noisy  acceleration  estimates  would  oat  contribute 
very  much.  Typically,  in  intercept  problems,  the  missile  require*  only  line  of 
sight  rates  for  deployment. 

For  the  TWS  system,  both  search  and  track  update  are  done 
simultaneously.  At  the  end  of  the  scan  interval,  all  observation*  received 
during  the  scan  are  correlated  with  the  existing  tracks.  An  operational  air- 
surveillance  system  must  be  capable  of  tracking  many  target* 
simultaneously  in  an  environment  that  may  provide  large  numbers  of  false 
.arget  indications  due  to  real  and  artificial  clutter  as  well  as  system  noise. 

The  measurement  accuracy  obtained  and  the  tracking  precision  of  such 
systems  may  not  demand  the  most  computationally  complex  filtering 
operations.  It  is  essential  that  the  filtering  operations  give  sufficient  support 
to  the  association  procedures;  any  complexity  beyond  that  and  not  necessary 
for  this  task  is  of  diminishing  vaiue.  It  is  important  however  that  the  filter 
be  sufficiently  flexible  to  adjust  quickly  to  changes  in  the  tracking 
environment.  The  literature  on  the  techniques  of  track  filtering  is  very  large 
and  diverse.  It  is  important  to  use  3impie,  easily  implemented,  and 
computationally  inexpensive  filters  which  nevertheless  retain  as  far  as 
possible  the  features  of  optimality  and  flexibility  which  the  most  general  and 
expensive  forms  embody. 

3y  automating  the  target  detection  and  tracking  process  (Automatic 
Detection  and  Tracking)  which  is  called  ADT.  the  bandwidth  m  the  output  of 
a  radar  is  reduced  so  as  to  allow  the  radar  data  to  be  transmitted  to  another 
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location  via  narrowband  communication  links  rather  than  wideband 
communication  links  [Ref.  1 2,13).  This  permits  the  outputs  of  many  radars 
to  be  intercommunicated  economically. 

B.  TVS  RADAR  FUNCTIONS 

The  signal  processor  determines  the  presence  or  absence  of  targets  while 
rejecting  unwanted  signals  due  to  ground  clutter,  sea  clutter,  weather,  radio¬ 
frequency  interference,  noise  sources  and  intentional  jammers.  It  is 
performed  by  coherent  and/or  non-coherent  processing  of  time  samples  of 
the  received  signals  (Ref.  131.  The  signal  processor  is  implemented  in  real¬ 
time  special  purpose  hardware.  Basic  operations  that  are  now  routinely 
exploited  as  a  result  of  the  advances  in  digital  technology  (Ref.  14 1.  include: 

1.  Pulse  Compression  (PC) 

2.  Pulse  Doppler  Processing  (PDP) 

3.  Moving  Target  Indicator  (MTI) 

4.  Moving  Target  Detector  (MTD) 

3.  Constant  False  Alarm  Rate  (CFAR)  circuit 


39 


Reduction  Of  The  Dete  Flow 


Signal 

processor 


Raw  Video 


Track 

Information 


Video  Detection  Track  Inf  or. 


t 


Tabular  Display 


Enhancement  Of  Target  Information  Through 
Different  Processing 
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Tbe  data  extractor  provides  the  target  measurements  in  range,  angles 
(azimuth,  elevation),  radial  velocity  and  possibly  provides  target  signature. 
In  general,  a  target  may  cause  several  detections  in  adjacent  cells  in  range. 
Doppler  frequency  and  angles.  Tbe  centroid  of  tbe  corresponding  pattern  of 
detections  gives  an  estimate  of  tbe  target  measurements.  Tbe  data  extractor 
is  generally  implemented  with  a  dedicated  microcomputer. 

The  data  processing  is  performed  on  a  digital  computer  inserted  between 
the  plot  extractor  and  immediately  before  the  display.  It  can  be  defined  as 
the  set  of  algorithms  which,  when  applied  to  the  radar  detections  acquired 
during  successive  scans,  allows  the  following: 

1 .  Recognition  of  a  pattern  of  successive  detections  as  pertaining  to  the 
same  target. 

2.  Estimation  of  the  kinematic  parameters  (position,  velocity,  and 
acceleration)  of  a  target,  thus  establishing  a  so-called  "  target  track  ". 

3.  Extrapolation  of  the  track  parameters. 

4.  Distinguishing  of  different  targets  and  thus  establishing  a  different 
track  for  each  target. 

3.  Distinguishing  of  false  detections  (caused  by  intentional  or  natural 
interference)  from  true  targets. 

6.  Adaptive  refinement  of  the  threshold  setting  of  the  signal  processor 
in  order  to  make  tne  radar  more  or  less  sensitive  in  the  different 
spatial  directions,  depending  on  the  content  of  a  map  of  false 
detections  refreshed  on  a  scan-to-scan  base. 

As  shown  in  Figure  3.1.  it  is  important  to  emphasize  that  the  cascade  of 
signal  processor,  data  extractor,  and  data  processor  is  ultimately  a 
bandwidth  compressor.  It  receives  data  at  a  high  rate  and  processes  the 
signal  in  such  a  manner  that  a  relatively  low  data  rate  is  achieved.  This 
feature  is  pictorially  indicated  by  the  nat  rowing  of  the  arrows  moving  from 
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the  left  to  the  right  of  the  cascaded  processors.  At  the  same  time,  there  is  a 
progressive  discrimination  between  useful  and  clutter  data,  by  means  of  a 
stepwise  decision  process.  The  information  handled  by  the  processing  hain 
is  progressively  manipulated  into  a  form  which  allows  easier  decision 
making  by  the  user.  In  fact,  the  raw  video  signal  contains  many  false 
echoes.  The  data  ei tractor  isolates  the  useful  target  and  the  data  processor 
identifies  the  target  (  possibly  labelled  with  a  code),  determines  the  target 
velocity  and  additional  parameters  which  are  presented  in  a  tabular  display. 
A  further  observation  can  be  made  regarding  the  increase  of  the  time  span 
in  which  processing  is  performed  through  the  cascade.  The  signal  processor 
involves  only  a  few  pulses,  the  data  extractor  some  adjacent  groups  of  pulses 
and  the  data  processor  consecutive  radar  scans.  In  other  words,  the  memory 
of  the  processing  increases  on  moving  from  left  to  right  in  Figure  3.1. 

So.  these  TWS  systems  are  excellent  candidates  for  the  application  of  HEA. 
since  they  are  already  have  their  local  estimation  processors  which  perform 
the  functions  of  track  initiation,  plot-track  correlation,  track  prediction,  track 
filtering,  and  track  termination. 

C  RADAR  OUTPUT 

The  output  of  a  TWS  radar  is  generally  a  display  to  visualise  the 
information  contained  in  the  radar  echo  signal  in  a  form  suitable  for  operator 
interpretation  and  action.  The  visualised  information  on  the  display  is  called 
synthetic  video  '  in  contrast  to  the  so  called  "  raw  video  '  which  is  the 
information  shown  when  the  display  is  connected  directly  to  the  video 
output  of  the  receiver  [Ref.  14,151.  Synthetic  displays,  whilst  being  a 
processed  representation  of  the  signals  which  are  being  received  by  the 
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radar  sensors,  Have  the  advantage  that  all  responses,  irrespecuve  of  the 
weakness  of  the  returned  signal,  can  be  displayed  at  a  constant  level  of 
brilliance  and  clarity.  The  Plan  Position  Indicator  (PPI),  the  usual  display 
employed  in  radar,  indicates  range  and  azimuth  of  a  detected  target.  The 
idea  of  tracking  is  easily  visualized  if  successive  scans  containing  a  moving 
target  are  superimposed,  the  target  then  gives  rise  to  a  fairly  regularly 
spaced  sequence  of  returns. 

in  the  past,  an  operator  manually  marked  the  location  of  the  target  at 
each  scan  with  a  grease  pencil  on  the  face  of  PPI.  This  procedure  was  very 
simple  but  offered  poor  accuracy  and  simultaneous  processing  of  only  a  few 
targets,  due  to  operator  fatigue.  The  limitations  of  the  operator  have  been 
overcome  by  resorting  to  a  computer  which  automatically  performs  the 
whole  tracking  process.  This  computer  has  been  referred  to  as  the  ”  data 
processor  ’  in  Figure  3.1.  In  order  to  design  an  automatic  procedure  to  track 
one  or  more  targets,  it  is  convenient  to  examine  the  nature  of  the  plot 
sequence  provided  by  TWS  radar.  The  better  the  expected  properties  of  the 
sequence  can  be  defined,  the  greater  is  the  ability  of  the  tracker  to 
distinguish  among  different  targets  and  false  plots.  False  plots  are  caused  by 
clutter,  intentional  interference  and  noise  which  survives  the  action  of  signal 
processing.  The  spacing  of  the  consecutive  target  plots  is  caused  by  the 
target  velocity  which  may  vary  in  time  as  the  target  executes  various 
maneuvers.  In  the  case  where  the  target  is  an  aircraft,  upper  and  lower 
limits  can  be  placed  on  the  magnitude  of  the  velocity.  Further,  an  upper 
limit  on  the  magnitude  of  the  acceleration  of  the  aircraft  usefully  restricts 
the  possible  tracks  the  aircraft  can  make. 
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D.  CLASSES  OF  TRACKS 

In  a  modern  TWS  system,  a  set  of  computer  software  establishes  and 
maintains  a  number  of  files  pertaining  to  three  different  classes: 

1.  Firm  tracks;  a  firm  track  occurs  when  the  path  of  a  target  has  been 
acquired  by  the  data  processor  and  the  kinematic  parameters 
estimated  with  sufficient  accuracy. 

2.  Tentative  tracks;  a  tentative  track  corresponds  to  the  first  phase  of 
the  track  acquisition  of  target  or  clutter  plots. 

3.  Stationary  tracks;  a  stationary  track  pertains  to  clutter,  since  its 
position  does  not  significantly  change  from  scan  to  scan. 

The  files  are  produced  by  processing  the  plots  received  from  the  radar  on 
a  scan- to- scan  basis  and  stored  in  a  buffer  as.  as  outlined  in  Figure  3.2.  The 
contents  of  the  output  buffers  are  periodically  updated  and  displayed  to  the 
operator.  A  firm  track  occurs  when  the  path  of  a  target  has  been  aquired  by 
the  data  processor  and  the  kinematic  parameters  estimated  with  sufficient 
accuracy.  By  contrast,  a  tentative  track  corresponds  to  the  first  phase  of  the 
track  acquisition  of  target  or  clutter  plots.  Finally,  a  stationary  track 
pertains  to  clutter,  since  its  position  does  not  significantly  change  from  scan 
to  scan.  The  tracks  have  been  divided  into  different  classes  because  their 
corresponding  processing  and  utilization  differ. 


44 


E  THE  CLUTTER  MAP 

Because  an  operational  environment  may  be  affected  by  a  large  number 
of  false  target  indications  caused  by  fixed  and  moving  clutter  as  well  as 
system  noise,  a  particular  attention  must  be  paid  to  filtering  out  these 
disturbances.  This  is  carried  out  by  the  formation  and  updating  of  a  clutter 
map,  which  identifies  spurious  plots  from  the  input  buffer  so  that  the 
remaining  plots  pertain  only  to  valid  targets.  The  use  of  the  clutter  map 
produces  some  benefits  to  a  radar  operating  in  a  clutter  environment: 

1.  It  allows  proper  selection  of  signal- processor  operating  modes  (  e.g. 
the  MTI  is  switched  on  only  where  it  is  needed,  thus  avoiding 
unnecessary  detection  performance  degradation  in  a  clear 
environment,  the  thresholds  of  the  detection  logic  are  controlled 
according  to  the  residual  clutter  power). 

2.  It  allows  removal  of  clutter  points  from  the  radar  plot  buffer,  thus 
preventing  saturation  of  the  processing  and  the  formation  of  spurious 
tracks  which  reduce  confidence  in  the  track  data.  The  clutter  map 
updating  process  is  also  referred  to  as  a  Stationary  Track  Filter  (STF)  or 
Scan  to  Scan  Correlator  (SSC).  IRef.  17). 

The  STF  acts  as  a  velocity  discriminator  upon  the  plot  stream  coming 
from  the  data  extractor.  Plots  which  appear  to  be  stationary  or  slowly 
moving  from  scan  to  scan  are  stored  in  the  clutter  map.  Each  new  incoming 
plot  is  compared  with  the  content  of  the  map,  if  the  plot  falls  within  a 
defined  capture  area  around  one  of  the  clutter  map  entries,  then  the  new 
plot  is  deemed  to  be  clutter.  As  a  consequence  .  it  updates  the  clutter  map 
content  and  is  deleted  from  the  radar  plot  buffer.  STF  can  also  be  considered 
as  a  device  capable  of  forming  cancellation  masks  centered  around  targets 
(true  or  spurious)  with  a  velocity  below  an  established  threshold  value. 
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The  velocity  discrimination  is  performed  by  comparing  the  plot 
displacement  in  two  or  more  of  the  subsequent  scans  with  respect  to  a  gate 
centred  around  the  first  position  (already  measured)  of  the  clutter  map 
entry.  It  is  obvious  that,  for  clutter  points  or  slow- moving  targets,  the 
subsequent  echoes  remain  in  the  gate  for  a  large  number  of  scans,  but  for 
fast  targets,  the  echoes  are  in  the  gate  for  only  few  scans.  By  computation  of 
the  scans  required  by  the  target  to  leave  the  gate,  it  is  possible  to  determine 
its  velocity. 

F.  OPERATIONAL  REQUIREMENTS 

The  operational  requirements  for  data  processing  vary  with  the  type  of 
application.  Typical  requirements  are  estimation  accuracy,  extrapolation 
time  and  system  reliability  (Ref.  17).  In  maritime  collision- avoidance 
systems  it  is  necessary  to  detect  potential  collisions  well  in  advance  ( 1 5-30 
minutes)  because  of  long  reaction  times,  especially  for  large  ships  such  as  oil 
tankers.  The  estimation  error  on  the  forecast  position  must  be  lower  than 
one  nautical  mile.  Therefore  the  accuracy  by  which  the  velocity  is  estimated 
must  be  within  1  knot  This  requirement  determines  the  degree  of  filtering 
to  be  applied  taking  into  account  the  measurement  error  and  the  data  rate  of 
the  radar. 

In  the  case  of  ATC  the  radar  information  may  support  the  function  of 
tactical  control,  conflict  alert  and  approach  control.  In  tactical  control,  the 
radar  controller  checks  the  current  positions  of  the  aircraft  to  maintain  the 
standard  separations.  Whenever  the  separations  tend  to  be  violated,  the 
radar  controller  advises  the  piiots  involved  about  the  local  trajectory 
modifications  required  to  re-establish  acceptable  conditions.  In  conflict 
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alert,  the  processing  system  estimates  the  forecast  positions  of  all  the 
aircraft  to  determine  the  conflicting  pairs  of  aircraft;  the  extrapolation  time 
is  from  1  to  2  minutes.  The  processing  system  can  also  evaluate  the 
trajectory  modifications  for  one  or  both  aircraft  to  resolve  the  conflict 
situation.  In  this  case,  one  nautical  mile  may  also  be  assumed  as  an 
acceptable  accuracy  in  the  estimation  of  forecast  position.  Of  course,  possible 
maneuvers  during  the  extrapolation  time  may  be  taken  into  account  in  the 
evaluation  of  the  conflict  area.  Therefore,  where  two  aircraft  are  conflict 
free  when  flying  straight  and  could  be  taken  into  conflict  if  one  or  both 
maneuver,  an  order  may  be  given  to  both  aircraft  to  keep  the  straight 
trajectory  constant.  In  approach  control,  the  controller  checks  that  aircraft 
follow  fixed  paths  to  assure  a  safe  landing.  A  higher  accuracy  than  the 
preceding  one  is  required  here,  i.e.  a  fraction  of  one  nautical  mile. 

In  an  air-defence  system,  the  estimated  trajectory  is  generally  used  to 
help  perform  some  of  the  following  functions; 

a.  threat  identification 

b.  threat  evaluation 

c.  calculation  of  the  forecast  position  (for  fire  or  launch  of  missile) 

d.  weapon  assignment 

e.  kill  evaluation 

The  functions  a  to  e  do  not  necessarily  use  the  measurements  of  one  single 
radar.  For  example  the  fire  control  function  c  is  generally  performed  by  a 
tracking  radar  with  the  characteristics  of  good  accuracy  and  high  data  rate. 

It  may  be  noted  that  in  ATC,  all  the  trajectories  are  easy  to  follow 
because  of  path  regularity,  low  acceleration  (2-3  m/s2)  and  pilot 
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collaboration;  whereas,  in  air  defence,  targets  have  high  accelerations  (10-50 
m/s2)  and  manoeuvres  that  are  intentionally  evasive.  Furthermore,  it  is 
important  to  have  a  very  short  reaction  time,  especially  for  targets  detected 
at  short  range  such  as  low-flying  aircraft  and  sea  -skimmer  missiles. 
Generally,  the  data  processing  from  a  radar  extractor  is  more  difficult  for 
air-defence  systems  than  that  for  civil-traffic  control,  because  the  target 
acceleration  is  high  and  unpredictable.  In  addition,  jammers  interfere  with 
the  proper  working  of  the  system.  "The  effects  of  these  phenomena  are 
reduced  by  radar  signal  processing  using  particular  devices,  such  as  moving- 
target  detector  and  multisidelobe  canceller,  which  limit  the  false  detections. 

G.  NETTED  RADAR  SYSTEMS 

Much  recent  interest  has  centered  around  radar  netting.  Estimation  of 
location,  velocity  and  maneuver  together  with  possible  identification  of  each 
relevant  target  can  be  provided  by  Radar  Data  Processing  (RDP)  with  an 
accuracy  and  reliability  greater  than  that  available  from  a  single  look  radar 
report.  Today,  it  is  also  very  important  to  net  different  types  of  sensors  in 
addition  to  radars  in  order  to  enhance  performance.  The  current  various 
types  of  radar  networks  can  be  classified  according  to  the  level  at  which  the 
merging  of  data  is  taking  place  into  the  two  following  classes: 

1.  centralized 

2.  distributed 

The  centralized  architecture  (see  Figure  3  3)  is  characterised  by  the  use  of 
a  single  data  processor  to  which  radar  plots  are  transmitted  from  radar  sites. 
These  measurements  are  processed  so  as  to  obtain  a  single  multiradar  track 
for  each  target.  This  architecture  has  the  features  of  . 
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».  using  tracking  algorithms  with  variable  data  rate. 

b.  reducing  tracking  errors  because  of  the  higher  data  rate  than  with  a 
single  radar. 

c.  requiring  more  powerful  processing  resources. 

d.  being  the  prevalent  type  in  military  applications  owing  to  the  higher 
accuracy  gained. 

The  distributed  architecture  (see  Figure  3-4)  is  characterised  by  the  use  of 
a  computer  at  each  site  performing  the  tracking  function  on  the 
measurements  of  a  single  radar.  The  monoradar  tracks  instead  of  radar 
plots  are  then  transmitted  to  a  single  data- processing  centre  which  combines 
them  in  order  to  establish  a  single  multiradar  track  for  each  target  The 
following  features  are  relevant  for  this  architecture: 

-  it  is  capable  of  local  operation. 

-  it  requires  computing  resources  of  limited  power. 

-  it  mainly  requires  monoradar  processing  algorithms. 

-  it  prevails  in  ATC  because  of  simplicity,  reliability  and  growth  capability. 
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FIGURE  3  3-  Centralized  Architecture  of  a  Radar  Network 
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FIGURE  3 A-  Distributed  Architecture  of  a  Radar  Network 


The  proposed  HEA  as  a  distributed  architecture  applied  for  radar  netting 
would  be  convenient  and  cost  effective  to  be  used  specially  for  modern  TWS 
radars  whicli  have  their  local  tracking  systems  already  installed  and  it 
guarantees  the  maximum  utilization  of  the  local  resources  of  each  radar  site 
and  taking  advantage  of  the  advanced  techniques  taking  place  in  each  major 
block  oi'  the  radar  systems  (Figure  3.5).  The  problem  of  the  lack  of 
experienced  decision  makers  which  lead  to  the  centralization  of  decision 
making  can  be  solved  by  the  use  of  expert  systems  and  encapsulate  the 
decision  maker  expertise  in  computer  software. 
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FIGURE  3.5.  Advanced  Techniques  in  the 
Mam  Blocks  of  a  Radar  System 
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IV.  HEA  APPLICATION  TO  MMT  PROBLEMS 

A.  OVERLAPPING  COVERAGE  OF  A  RADAR  NETWORK 

The  HEA  is  applied  to  the  Multitarget  Multisensor  Tracking  (MMT) 
problems  in  a  tactical  air  surveillance  system.  The  problem  is  to  produce  a 
statistically  meaningful  estimate  of  both  the  number  of  the  targets  present, 
and  of  their  trajectories.  It  is  assumed  that  the  system  employs  Track* 
While-Scan  (TWS)  radars  with  different  scan  rates  and  with  overlapped 
coverage.  The  degree  of  overlap  of  the  radar  coverage  is  defined  is  as 

N 
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Where  Aj  is  the  area  covered  by  the  j  radar 

Atot  is  the  total  surveillance  area  controlled  by  the  radar 

N  is  the  number  of  radar  sites(nodes) 

The  parameter  8  ranges  from  1  (absence  of  overlap)  to  N  (total  overlap). 

Figure  4.1,  shows  an  example  of  overlapping  coverages  between  three 
radars.  R1 ,  R2 .  R3 .  If  the  degree  of  overlap  is  very  small,  the  advantages  of 

the  data  redundancy  ars  limited, on  average,  to  small  areas  and  few  targets. 
Assuming  that  the  targets  are  evenly  distributed  in  A  with  density  6,  then  it 
follows  from  equation  (  4 A  )  that 
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(4.3) 


N 
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Where  n  is  the  number  of  targets  in  the  area  A 

J  J 

ntot  is  the  number  of  targets  in  the  totai  area  A 


FIGURE  4.1.  Example  of  Overlapping  Coverage  Between 
3  Radars  in  a  Network 

Generally,  the  most  obvious  exploitation  of  additional  radar  sites  is  to 
extend  the  coverage  beyond  the  maximum  range  of  a  single  radar,  as 
established  either  by  line  of  sight  or  by  radar  power.  Also,  the  viewing  of  a 
target  from  different  aspects  angles  tends  to  reduce  target  fades,  glint  and 
terrain  masking  effects.  It  is  assumed  that  radar  sites  will  be  primarily 
chosen  with  the  aim  of  optimal  radar  coverage  of  the  entire  network.  We 
consider  the  multitarget  and  multisensor  problems  separately,  and  then 
merge  the  two.  This  means  that  the  muititarget  tracking  problem  is  solved 
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at  tiie  sensor  (node)  level  using  the  appropriate  local  estimation  algorithm. 
When  the  muititarget  tracking  problem  is  solved  separately  for  each 
individual  node,  a  somewhat  redundant  view  of  the  surveillance  area  will 
result,  depending  on  the  degree  of  overlapping  coverages  between  the 
radars.  The  output  of  the  local  estimator  is  a  group  of  different  tracks.  A 
track,  is  meant  to  be  a  state  trajectory  estimated  from  a  set  of  sensor 
measurements  that  have  been  associated  with  the  same  target  (Ref.  IS  I.  By 
assuming  that  the  multitarget  tracking  problem  is  solved  separately  for  each 
individual  sensor  (node),  the  track  estimates  for  targets  in  the  surveillance 
area  become  consolidated  first  at  the  level  of  each  individual  node.  The 
information  fusion  process  decides  whether  more  than  one  track  from 
different  nodes  represent  the  same  target,  and  combines  the  corresponding 
consistent  tracks.  The  techniques  for  information  fusion  can  be  based  on 
algorithmic  processing  of  target  kinematic  information  and  heuristic 
reasoning  for  attribute  information  by  using  expert  systems  to  encapsulate 
target  identity,  behaviour,  intent,  tactical  appreciation  and  human  expertise 
in  computer  software. 

B.  DISCRETIZATION  OF  A  SYSTEM  DYNAMICS  MODEL 

The  mathematical  model  of  a  target  motion,  as  state  equation,  is  derived 
when  assuming  that  the  target  normally  moves  at  constant  velocity,  and 
turns  or  evasive  maneuvers  may  be  considered  as  perturbations  upon  the 
straight  lines.  Therefore  acceleration  is  a  driving  input  for  the  state 
equation  which  is  usually  linear.  A  simple  way  to  model  the  unpredictable 
behaviour  of  acceleration  is  to  consider  a  non-Gaussian  stochastic  stationary 
process  (symmetric  with  zero  mean  and  proper  standard  deviation)  with  a 
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correlation  depending  on  the  time  duration  of  manoeuvre  (Ref.  18).  When 
an  aircraft  flies,  it  normally  maintains  a  constant  velocity,  constant  heading 
trajectory.  Consequently,  the  system  dynamics  model  is  given  by  the 
following  continuous  time  state  equation 

X  (t)  *  A  x(t)  ♦  6  w(t)  (4.4) 

Where  w(t)  is  the  random  forcing  function  (  continuous  time  vhiie  process 
noise).  w(t)  ~(  0,Q  ).  We  wish  to  put  the  continuous  equation  (4.4)  into  the 
discrete  form 

x  (k*l)  -  ♦x(k)  ♦  w(k)  (4.5) 

Where  w(t)  is  the  contribution  from  the  random  forcing  function.  The 
general  solution  of  (4.4)  for  x(t)  given  the  value  x(tQ)  at  initial  time  tQ  is 

t. 

x(t)  x(t0)  *  /♦(t,T)G(T)  W(T)  dr  (4.6) 

Defining  T  to  be  the  sampling  interval,  to  describe  the  state  propagation 
between  states,  let  t  *  (k*  I  )T  ,  tQ  a  kT  for  an  integer  k  and  assuming 

stationarity.  so  that: 

♦  (t,t0)«  ♦(t-t0)  *  *(T)  (4.7) 

Defining  the  sampled  state  function  as  x(k)  -  x(kT  )  we  can  write 

M)T 

x(k+l)  »  ♦  (  T  )  x(k)  ♦  /♦  ((k*  1 )  T-r)  G  w(r)dT  (4.8) 

kT 

The  second  term  in  the  right  hand  side  is  a  low  pass  filtered  version  of  the 
continuous  white  process  noise  w(t)  weighted  by  the  state  transition  matrix 
and  the  noise  input  matrix  G. 
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On  changing  variable  twice  (  £  =  t-  kT  and  then  Also,  t  =  T  -  £),  the  limits 
of  integration  can  be  set  to  0  and  T,  so  that,  as  a  first  step  in  the  discrete 
formulation  of  (A A),  we  obtain 

T 

x(k*l)  *  *(  T  )  x(k)  ♦  /♦(t)  G  w(t)  dr  (-4.9) 

0 

Again  under  the  assumption  of  stationary,  the  transition  matrix  can  be 
expressed  in  terms  of  the  matrix  exponential.  Then,  an  expansion  can  be 
performed  to  give  an  approximate  solution: 

♦  (  T  l-e*1  *  I  ♦  AT  ♦  (  AT  )2 12  + . ♦(  AT  )Vn!  (4.10) 

Where  I  is  the  identity  matrix.  If  terms  of  order  T2  and  higher  are 
disregarded,  then  the  result  is  Euler  s  approximation  to  the  sampled  system. 
Equation  (4.9)  becomes 

♦  (!)•  I  ♦  AT  (4.11) 

and  the  discrete  ■  time  process  noise  w(k)  relates  to  the  continuous  time  one 
as  follows 

T 

w(k)«  J+(T)Gw(T)dT  (4.12) 

0 

To  find  the  covariance  Q  (k)  of  the  new  noise  sequence  w(k)  in  terms  of  0. 
we  write 

Q  (k)  -  E  (  w(k)  wT(k)  1 

M)T 

0  (k)  *  J* J*$((k*  1  )T-t)  G  E  Iw(t)  wT(a)J  GT*T((k*l  )T-a)  di  do  (4.13) 

kT 
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(4.14) 


But  E  lw( t)  wT  (a)l  -  Q  6(  T-  a) 

So 

(k*l)T 

Q  (k)  »  /♦((Ic1)T-t)GQGt  ♦t  ((k*  1  )T-  t)  dr 

kT 

by  changing  variables  twice  as  before 

T 

0  (k)  ■  J*(t)GQGt  ♦t(t)  dr  (4.15) 

o 

It  is  worth  noting  that  even  if  Q  is  diagonal,  Q  (k)  need  not  be.  Sampling  can 
destroy  independence  among  the  components  of  the  process  noise.  Using 
(4.10)  then 

Q  (k)  -  G  Q  GT  T  ♦  (A  G  Q  GT ♦  G  Q  GTAT  )T2  /  21  ♦ .  (4.16) 

In  Euler  s  approximation  the  process  noise  covariance  Q  (k)  results  from 
multiplication  by  T 

Q  (k)  -  e  0  gtt 

Figure  4.2.  shows  the  sequence  of  operations  used  to  evolve  the  discretized 
equation  of  an  aircraft. 
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System  State  Equation 


x(t)  *  A  x(t)  ♦  G  w(t) 


FIGURE  4.2.  Evolving  of  the  Discretized  System 
Dynamic  Model 
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C  MMT  MATHEMATICAL  MODEL 

Suppose  that  s  targets  are  present,  and  the  discretized  equations  of 
motion  for  each  target  being  tracked  is  adequately  modeled  by  a  separate 


stochastic  difference  equations  of  the  form 

x'(k*  l )  -  ♦  ((<•*■  l  ,k)  x'(k)  ♦  w  (k)  k  ■0,1,2, . ,n  (-4.17) 

i  *  1 ,2,3 . .3 


The  superscript  indices  i  denote  the  various  targets.  x'(k)  is  the  nxi  state 

vector  (which  generally  includes  at  least  position  and  velocity  coordinates)  of 
the  tracked  target  at  the  k&  sample  time.  ♦'  (...)  is  the  (nxn)  state  transition 
matriz,  and  w'  (k)  is  a  pi  xl  state  ezcitation  vector,  which  is  usually 
constructed  in  aircraft  tracking  applications  to  account  for  both  maneuvers 
and  modeling  errors,  and  is  generally  assumed  to  be  white  and  gaussian, 
with  zero  mean  and  covariance  Q'(k).  In  a  UPS  system,  the  k^  sample  will 
occur  approximately  at  time  kTJ ,  where  TJ  is  the  scan  time  of  the  j1*1  sensor 
The  corresponding  discrete  measurement  vector  from  each  node  is  given  by 

zJ(k)  -  hJ(x'(k).k)  ♦  vJ(k)  j  .1,2 . N  (4.18) 

Where  the  superscript  j  indicates  the  various  nodes.  zJ (k )  is  a  qJ  xi 
dimensional  measurement  vector  at  node  j  at  stage  k.  hJ(.J  is  the 
observation  equation  for  sensor.  vJ(k)  is  assumed  to  be  white  and 
Gaussian,  with  zero  mean  and  covariance  RJ(k).  It  is  further  assumed  that 
w'(k)  and  vJ(k)  are  not  serially  or  cross-dependent,  in  particular 

S(  w'(A)  vJ(2)  j  -0  for  all  k  and  L  (4.19) 

In  the  s-targetcase,  each  individual  measurement  zJ(k)  of  an  actual  target 
at  time  k  is  drawn  from  a  mixture  probability  density  of  the  form 
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(4.20) 


p(zj(k)  I  x1(k).x2(k) . x9(k))  -  2p(  P(2J(k)  { x'(k» 

1=1 

with  unknown  priors  denoted  p.. 

Denoting  the  number  of  measurements  at  time  k  by  m(k),  a  set  of  ^m(k) 

(  k=1 

measurements  is 

m(k)  n 

ZJ  *  (  (  zJ  (k) }  }  J  -  1,2 . ,N  (4.21) 

l  1=1  k=1 

Where  ZJ  is  the  cumulative  set  of  measurements  at  each  node. 

It  is  the  essence  of  multitarget  tracking  problems  that  there  is  a  large 
amount  of  overlap  among  the  component  densities  of  (4.20),  so  that  the 
target-to- measurement  correspondece  x'  (k)«-*  zJ (k)  is  difficult  to  obtain. 
The  set  of  all  measurements  ZJ  can  be  partitioned  into  m  tracks 

X'C  T 

V  ■  x1  U  X2  U  X3  U . uxm 

where  X'  O  AJ  *  0  for  l^j 

One  of  the  tracks  (e.g  Am )  contains  all  of  the  false  alarms.  With  this  notation, 
any  hypothesis  concerning  measurements  made  by  the  surveillance  system  ( 
the  number  of  targets  present,  which  data  points  belong  to  which  target)  can 
be  defined  as  a  family  of  subset 

AeCZJ 

The  defining  characteristic  of  multitarget  tracking  problem  is  that  a  number 
of  partitions  can  be  found,  each  one  composed  of  tracks  X*  that  appear 

feasible  from  the  standpoint  of  the  likelihood  tests. 
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D.  THE  HORIZONTAL  ESTIMATOR 

Figure  4.3.  shows  the  processing  of  target  data  using  the  Horizontal 
Estimator  (HE)  at  each  node.  By  using  tracking  filters  such  as  those  discussed 
in  references  (Ref.  IS.  19,  20,  211,  produces  local  track  data  bases  consisting 
of  the  target  state  vectors  and  estimation  error  covariances  at  specific  points 
of  time.  Typically,  the  local  track  data  base  will  contain  kinematic 
information  and  sometimes  attribute  information  is  included.  The  typical 
kinematic  information  would  be  : 

x* J  (k|k )  -  The  target  local  track  estimate  at  time  k  using  the  local  sensor 
data  of  node  j.  i.e 

x^-BIx'lZ'l 

p*  J(k|k)  ~  The  local  error  covariance  matrix  associated  with  the  target 
local  estimate  x'  J(k|k),  i.e 

pU-E((  x'-fi1  XxWlZ'l 

Thus.  (x,J.  p' J )  records  are  created  by  processing  the  raw  sensor  for  each 
node  with  an  appropriate  filter  (local  estimator)  to  produce  the  local  track 
data  base. 

Before  fusing  any  two  tracks  from  two  different  nodes,  the  two  tracks  are 
referred  to  the  same  coordinate  system  using  the  required  transformation, 
and  to  the  common  time  instant  using  the  prediction  equations  of  the  Kalman 
filter  (Ref.  221. 

5,J(tltu)-V(tltu)  x'^tjjltjj)  ' ~ 1,2  14.22) 

J  -  1,2 

and  (tit,,)  piJ  U(Jlltj)  '  =  1,2  (4.23) 

J  -  1,2 
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Where  t  is  the  common  time  iostant 

t  ( j  is  the  time  of  the  last  updating  for  the  ith  track  of  the  jth  sensor 

is  the  transition  matrix  of  the  target  model 
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1 

i 


1GURE  4.3.  Processing  ot  Radar  Data  at  Each  Node  m  HEA 
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V.  LOCAL  ESTIMATION 


A.  LOCAL  ESTIMATOR  FUNCTIONS 

Generally,  the  determination  of  position  and  velocity  of  a  target  using 
radar  measurements  such  as  range,  bearing  and  range  rate,  is  a  problem  of 
nonlinear  estimation.  In  many  cases,  the  relationship  between  the 
measured  data  (e.g,  range,  azimuth,  doppter  velocity  )  and  the  target 
dynamic  parameters  is  nonlinear.  A  rigorous  treatment  of  the  nonlinear 
estimation  problem  requires  the  use  of  stochastic  integrals  and  stochastic 
differential  equations.  The  optimal  (  conditional  mean  )  nonlinear  estimator 
cannot  be  realized  with  a  finite-dimensional  implementation,  and 
consequently,  all  practical  nonlinear  filters  must  be  suboptimal  (Ref.  191. 

As  shown  in  Figure  51,  the  local  estimator  performs  the  functions  of 
track  initiation,  plot- track  correlation,  track  prediction,  track  filtering,  and 
track  termination.  An  overwhelming  number  of  approaches  to  filtering  and 
prediction  for  multitarget  tracking  have  been  developed  recently  in  response 
to  the  ever-increasing  importance  of  the  subject  However,  at  this  stage  of 
development  no  standard  approaches  are  generally  accepted  for  all 
applications.  A  wide  variety  of  techniques  have  been  proposed  for  many 
diverse  applications,  but  the  multitarget  system  designer  must  choose  the 
techniques  best  suited  to  his  particular  problem. 
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PREDICTED  TRACK 


FIGURE  3. 1  Basic  Functions  of  Local  Estimator 


There  is  a  comprehensive  set  of  papers  that  iiiustrate  commonly  known 
techniques  for  solving  the  multitarget  tracking  problem.  Among  them  is  the 
paper  by  Reid  (Ref.  231.  the  survey  paper  by  Bar-shalom  (Ref.  201.  and  the 
paper  by  Chang  (Ref.  191.  There  are  two  most  commonly  used  conventional 
approaches  to  filtering  and  prediction  for  muititarget  tracking  (Ref.  181.  The 
first  is  to  use  fixed  tracking  coefficients,  like  the  a-p  tracker  and  the  a-p-y 
tracker  (Ref.  18.  24,  23.  261  which  has  computational  advantages  lor 
systems  with  large  numbers  of  targets.  The  second  is  Kalman  filtering 
which  generates  time-variable  tracking  coefficients  that  are  determined  by  a 
priori  models  for  the  statistics  of  measurement  noise  and  target  dynamics 
So.  a  Kalman  filter  (KFHs  a  computational  algorithm  that  processes 
measurements  to  deduce  a  minimum  variance,  unbiased  error  estimate  of 
the  state  of  a  system  by  using  the  system  and  measurement  dynamics, 
assumed  statistics  of  system  noises  and  measurement  errors,  and  known 
initial  condition  information. 

B.  CONVENTIONAL  KALMAN  FILTERING 

With  expanding  computer  capabilities,  the  Kalman  filtering  is  becoming 
increasingly  more  appealing  to  the  system  designer.  The  Kalman  gain 
sequence  is  chosen  automatically,  based  on  the  assumed  target  maneuver 
and  measurement  noise  models,  which  means  that  the  same  filter  can  be 
used  for  varying  targets  and  measurement  environments,  by  changing  a  few 
key  parameters.  Also,  the  Kalman  filter  provides  a  convenient  measure  of 
the  estimation  accuracy  through  the  covariance  matrix.  This  measure  is 
required  to  perform  the  gating  and  correlat:on  function  accurately.  In 
addition,  having  a  measure  of  the  innovation  sequence,  is  useful  for 
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maneuver  detection,  and  upon  maneuver  detection  the  Kalman  filter  model 
provides  a  convenient  way  to  adjust  for  varying  target  dynamics.  The  five 
equations  for  the  conventional  Kalman  filter  are: 


x(klk- 1 )  ■  ♦  x(k- 1  Ik- 1 )  *  w  state  extrapolation 

p(k|k- 1 )  *  $  p(k-  i  Ik- 1 )  ♦  Q  error  covariance  extrapolation 

K(k) 3  ptklk-1 )  H1  (H  p(k|k-  i )  HT  ♦  R)'1  Kalman  gain 

p(kik)  -  (I  -  KH  )  p(k|k- ! )  error  covariance  update 

x(k|k)  -  xiklk- i )  *  K  (z(k)  -  H  x(klk- 1  ))  state  update 

x(klk- 1 )  time  updated  state  vector 

x(k!k)  measurement  updated  state  vector 

plklk- 1 )  time  updated  error  covariance 

p(klk)  measurement  updated  error  covariance 

$  state  transition  matrix 

K  Kalman  gain  matrix 

I  identity  matrix 

H  measurement  transformation  matrix 

R  measurement  error  covariance  matrix 

w  process  noise 

Q  process  noise  covariance  matrix 

y(k)  *  (z(k)  -  H  x(klk- 1 ))  measurement  residual 
Wk)  3  H  piklk- 1 )  HT  ♦  R  measurement  residual  covariance 


(5.1) 

(5.2) 
(5-3) 

(5.4) 

(5.5) 


A 

Once  the  initial  state  x(0|0)  and  the  initial  error  covariance  matrii  p(OIO) 
are  established,  the  Kalman  equations  can  be  activated.  Figure  5-2.  shows  a 
simplified  scheme  of  Kalman  filter  operation. 
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When  Kalman  introduced  these  equations  ((3.1 )  -  (3  5))  27  years  ago  [Ref. 
291,  they  ofTered  the  engineering  community  a  means  to  model  discrete  - 
time  systems  via  the  state-space  modeling  method  for  multivariable 
systems.  Unfortunately,  when  the  state  equation  was  written  for  the 
tracking  applications.  Kalman's  conventional  equations  were  found  to  fall 
short  in  two  respects: 

a.  Finite  wordlength  computation. 

b.  Changing  system  dynamics  model  in  real  time. 


Measurement 


Optimal 

State 

Estimate 


Figure  3.2.  Simplified  Scheme  Of  Kalman  Filter 
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C  THE  EXTENDED  KALMAN  FILTER 

For  nonlinear  modeling  problems,  the  Extended  Kalman  Filter  (EKF)  is 
used  to  extend  the  linearized  Kalman  filter  design  by  relinearizing  about 
each  estimate  ( i.e.,  Kalman  equation  (5. 1 ))  once  it  has  been  computed.  The 
sucess  of  the  method  of  linearization  about  a  nominal  trajectory  in  state 
space  depends  upon  the  accuracy  of  the  nominal  trajectory.  This  technique 
has  little  hope  of  success  in  a  situation  where  there  is  almost  no  prior 
information  aoout  the  trajectory  as  in  the  situation  of  the  target  acquisition 
problem.  Here,  a  psuedo  a  priori  may  be  generated  from  the  incoming 
observations.  In  EKF,  as  soon  as  a  new  state  estimate  is  made,  a  new  and 
better  reference  state  trajectory  is  incorporated  into  the  estimation  process. 
In  this  manner,  one  enhances  the  validity  of  the  assumption  that  deviations 
from  the  reference  (nominal)  trajectory  are  small  enough  to  allow  linear 
perturbation  techriques  to  be  employed  with  adequate  results  (Ref.  27). 
Radar  measurements  ( range,  azimuth,  elevation,  and  possibly  range  rate) 
are  in  polar  (spherical)  coordinates.  Thus  there  exist  a  known,  nonlinear 
relationship  or  transformation  between  the  state  of  the  system  and 
observations. 
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Of  The  Earth 

ite  Frame  for  TWS  System 


3 


Assuming  NED  coordinate  frame  shown  in  Figure  5  3,  where  X.Y.and  2  are 
North,  Bast,  and  Down,  respectively.  The  NED  coordinate  frame  is  chosen 
because  it  is  applicable  for  surface  (ground  or  shipbased)  tracking  systems, 
in  addition  that  it  is  particularly  useful  for  airborne  systems.  As  shown  in 
Figure  5  3.  the  origin  of  an  aircraft  tracking  system  is  the  TWS  system  site. 

It  is  worth  to  note  that  the  NED  system  is  not  strictly  an  inertial  system  for  a 
moving  platform  because  the  platform  axes  are  slowly  changing  their 
orientation  in  space  as  the  vehicle  moves  over  the  earth  s  surface.  However, 
except  near  the  North  pole  the  effects  of  the  rotations  are  negligible,  and  the 
NED  system  is  essentially  inertial  for  aircraft  platform.  For  the  coordinate 
system  shown  in  Figure  5  3.  we  will  have  the  following  relationships 
between  spherical  (polar)  and  cartisian  coordinates 


Range  r 3 (x2  ♦  y2  ♦  z2  )05  (5.6) 

Azimuth  <p 3  arctan  (y/x)  (5-7) 

Elevation  3  -  arcsin  (-z/r)  (5.8) 

Range  rate  r  3  (xx  ♦  yy  ♦  zz)/  r  (5.9) 

Denoting  that 

x9ax(klk-l)  .  x,ax(klk-l) 

ya*y(klk-!)  ,  y9ay(klk-i)  (5.10) 

A  A 

z9az(k|k-l)  .  z9*z(klk-!) 

r9  *  (x^y^z*2)0-5 


The  corresponding  measurement  transformation  matrix  H.  is  given  by 
IRef.  281 
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(5.11) 


H  - 


* 

4M®aar®nwnts 


*n 

0 

*13 

0 

*6 

0  )  Rang* 

*21 

0 

*23 

0 

0 

0  )  Azimuth 

*31 

0 

*33 

0 

*35 

0  )EI«vaUon 

*41 

*42 

*43 

*44 

*45 

*46  )  Rang®  Rat® 

6 

Status 

Where 

*n  *  x«  /  r# 
hc  -y®/r, 

hg  *  2g  / 

*21  “-y®/(x®2*y®2) 

h23  -  x*/(x*2  +  y*2) 

h3i  -  z,  x#  /((r*2)^2  ♦  y#2  )os ) 

*33  *  y«  z®  /((r*2)^2  ♦  y«2  )os ) 

*35  *  (“  (x®2  +  y®2)0,5)  /(r92) 

* 4i  *  (xe/rJ-((x9(x*x#*y®y®*Z#Z,)) /r*5) 
h42  «  x,/rt 

*43  *  (y®/ r«)  *((y«(x*Xg+  y9y9*  z9z9))  /r#3) 
*44  *  y#/ 

*45  *  (z,/r®) ’((z«(x*x,*  y®y®*  z*z,)) /r#3) 

*46  *  2® !  **• 


(5.12) 


D.  UD  COVARIANCE  FACTORIZED  KALMAN  FILTER 

The  most  troublesome  numerical  aspect  of  the  Kalman  filter  is  the 
measurement  update  of  the  error  covariance  matrix,  so  the  properties  of 
alternate  computational  forms  are  of  substantial  interest.  Potter  (Ref.  30) 
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introduced  a  square-root  approach  for  propagating  the  error  covariance 
matrix  in  the  absence  of  process  noise.  This  method  is  completely  successful 
in  maintaining  the  positive  semidefinite  nature  of  the  error  covariance,  and 
it  effectively  reduced  the  precision  requirements  to  about  half  the  number 
of  bits  needed  for  the  full  covariance  update.  The  outstanding  numerical 
characteristics  and  relative  simplicity  of  this  Potter  square-root  approach  led 
to  its  implementation  in  the  Apollo  navigation  filters  (Ref.  3  ll.  The 
numerical  characteristics  of  the  filter  have  been  further  improved  upon  by 
Agee  and  Turner  (Ref.  321.  Carlson  (Ref.  331  and  Bierman  (Ref.  34 1,  among 
others.  The  benefits  erf  square-root  filters  were  attractive  in  the  sense  of 
relieving  the  numerical  problems  and  maintaining  symmetry  of  the 
covariance  matrix.  Bierman  s  method  requires  the  fewest  arithmetic 
operations  of  the  square-root  formulations.  Bierman  s  UD  filter  factorizes  the 
covariance  matrix  P  in  the  filter  as  follows 

P-UDUr  (5.13) 

where  U  is  an  upper  triangular  matrix  with  1  s  along  its  main  diagonal  and  D 
is  a  diagonal  matrix.  The  UD  algorithm  requires  a  diagonal  measurement 
covariance  matrix  R,  which  implies  that  the  measurement  errors  are 
uncorrelated.  If  the  matrix  is  not  diagonal,  it  can  be  made  diagonal  as 
indicated  by  Yannone  (Ref.  33).  Bierman  derived  a  recursive  algorithm  for 
implementing  a  (Caiman  filter  in  terms  of  U  and  D,  rather  than  p  (Ref.  34. 

361. 
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E.  PARALLEL  KALMAN  FILTERING 

In  general,  real-time  filtering  cannot  be  performed  on  large-scale 
problems  using  a  uniprocessor  architecture  because  serious  processing  lags 
can  result.  The  Kalman  filter  can  be  extended  to  a  much  greater  class  of 
problems  by  using  parallel  processing  concepts.  Full  utilization  of 
parallelism  can  be  obtained  through  insight  in  the  structure  of  the  problem 
and  decoupling  of  arithmetic  processes  to  permit  concurrent  processing.  One 
must  simultaneously  develop  the  parallel  algorithms  for  solving  the  filtering 
problem,  and  the  associated  processor  architectures  to  achieve  the  maximum 
benefits  from  parallelism.  Parallel  Kalman  filter  architectures  based  on  this 
design  methodology  can  be  implemented  with  VLSI/VHSIC  technology  (Ref. 
371.  VLSI  technology  allows  the  designer  to  map  system  level  architectures 
directly  into  hardware.  To  date,  relatively  little  research  has  been  conducted 
on  restructuring  the  Kalman  filter  for  parallel  processing.  Three  approaches 
that  have  been  considered  include  (Ref.  371: 

a.  Vectorizing  the  standard  Kalman  filtering  equations  by  running  the  filter 
on  a  vector  (or  array  )  processor  (Ref.  381. 

b.  Uncorreiating  the  measurement  data  to  the  filter  so  that  each 
measurement  can  be  pipelined  into  each  processor  simultaneously 
(Ref.  391. 

c.  Decoupling  the  predictor  and  corrector  equations  in  the  filter  so  that 
these  computations  can  be  evaluated  simultaneously  on  separate 
processors  using  multiprocessing  (Ref.  36,  401. 

Although  the  first  approach  can  speed  up  computations  considerably  over 
conventional  techniques  ( speed-up  factors  of  6  to  1 0  have  been  realised 
(Ref.  33]),  the  computational  throughput  was  limited  primarily  by  the 
architecture  of  the  array  processor.  This  occurred  because  the  array 
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processor  architecture  was  optimized  for  Fast  Fourier  Transform  (FFT) 
computations,  not  (Caiman  filter  computations. 

An  approach  based  upon  mapping  the  (Caiman  filter  equations  directly 
onto  a  highly  parailel/pipelined  architecture  can  be  used.  Thus,  the  parallel 
(Caiman  filters  algorithms/architectures  developed  in  [Ref.  361  exploit  the 
structure  of  the  filter  to  improve  throughput  using  pipelining  and 
multiprocessing.  This  approach  to  speed  up  (Caiman  filter  computations  is  to 
perform  parallel  processing  at  three  major  levels: 

1.  The  measurement  data  to  the  filter  is  uncorrelated  so  that  each 
measurement  can  be  processed  simultaneously. 

2.  The  predictor  and  corrector  equations  of  the  (Caiman  filter  are  decoupled 
so  that  the  predictor  and  corrector  can  be  computed  on  separate 
processors. 

3.  The  measurement  data  are  pipelined  into  each  processor. 

Thus,  multiprocessing  and  pipelining  are  combined  to  achieve  large 
improvements  in  computational  speed.  Each  processor  architecture  can  be 
implemented  with  VLSI  technology.  To  facilitate  parallel  processing  and 
pipelining,  the  measurement  data  to  the  filter  should  be  uncorreiated.  The 
data  can  be  uncorrelated  by  diagonalizing  the  covariance  matrix  associated 
with  the  measurement  noise  in  the  filter.  Procedures  for  uncorreiating  the 
data  generally  utilize  coordinate  transformations  based  upon  eigenvalue  or 
singular  value  decomposition  (Ref.  41].  If  the  eigenvalues  of  R  are  not 
distinct,  the  generalized  eigenvectors  of  R  must  be  determined  (Ref.  411. 
Similarly,  if  the  eigenvalues  of  R  are  complex,  R  can  be  transformed  to  a 
block  diagonal  matrix  (Ref.  4 1  ]. 
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F.  ROBUSTNESS  OF  THE  KALMAN  FILTER 

An  important  part  of  the  design  of  the  Kalman  filter  is  to  regulate  the 
robustness  of  the  filter  to  handle  deviations  from  the  prescribed  systems 
dynamics  model  in  the  cases  when  the  target  makes  heading  changes. 
Without  some  form  of  re- modeling  adaptively  in  real-time,  the  filter 
diverges.  When  the  conventional  filter  is  emulated  for  constant 
velocity /constant  heading  targets,  filter  divergence  results  after  a  number  of 
iterations,  and  the  error  covariance  matrix  goes  negative  definite.  This  lack 
of  reliability  forces  the  seeking  of  a  more  stable  algorithm  for  implementing 
the  Kalman  filter.  The  estimator  eigenvalues  control  (Ref.  28],  can  be  used. 
In  a  Kalman  filter,  this  amounts  to  the  eigenvalues  of  (♦  -  KH).  The  control 
for  any  given  (<fr .  H)  pair,  is  a  function  of  K  .  K  is  a  function  of  p(k*  1  Ik) ,  and 
p(k+ 1  Ik)  is  a  function  ofO. 

Whatever  method  is  adopted  for  track  filtering,  it  is  usually  necessary  to 
combine  it  with  some  form  of  adaptation.  An  adaptive  system  is  one  which 
continually  adjusts  its  own  parameters  in  the  course  of  time  to  meet  a 
certain  performance  criterion.  On-line  adaptation  is  required  when 
significant  changes  occur  in  the  target  motion  (maneuvers),  measurement 
accuracy  or  frequency  of  detection.  The  excitation  noise  covariance  Q  is  a 
statistical  quantity  used  to  cover  uncertainties  in  the  model  of  target  motion 
described  by  (4.5).  Measurement  accuracy  is  described  statistically  by  the 
noise  covariance  R.  and  the  interval  between  filter  updates  is  given  by  the 
time  T  (  which  appears  in  <P,Q(k)).  Changes  in  the  tracking  environment 
must  be  reflected  in  the  appropriate  adjustment  of  these  three  filter 
parameters  during  the  track  estimation  process.  Adaptive  tracking  requires 
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the  on-line  computation  of  a  figure  of  merit ,  or  track  performance  indicator, 
whicn  typically  involves  a  weighted  combination  of  terms  in  the  residual 
(the  innovation  sequence)  tXk).  It  also  requires  a  practical  procedure  for 
determining  what  quantitative  adjustments  should  be  made  in  the  filter 
parameters.  Two  approaches  can  be  used  to  control  the  filter  performance: 

a.  The  system  s  eigenvalues  are  fixed  and  kept  constant.  By  holding  the 
eigenvalues  constant,  a  prescribed  degree  of  stability  is  maintained. 

b.  A  thresholding  technique  is  mechanized  to  adaptively  model 
maneuvering/nonmaneuvering  targets. 

The  first  method,  which  computes  the  eigenvalues  as  function  of  the  Q- 
matrix  with  constraints  based  upon  the  damping  sought  (critical, 
overdamped),  starts  with  Kalman  equations  (5.2),  (5  3).  and  (5  4): 

K  *  function  of  (p(k*  I  Ik),  R,  H) 
p(k*  Ilk)  *  function  of  (p(klk).  Q,  ♦) 

Solving  generally,  for  p(k*  Ilk)  and  substituting  into  the  equation  for  K  leads 
to  the  stability: 

(♦  -  KH)  ■  function  of  (p(kik),  Q,  ♦,  R.  H) 

The  eigenvalues  of  (♦  -  KH)  can  be  sought,  to  yield  a  characteristic 
polynomial 

<?U)-  f  *aiSn~1  +a2^~2  * . +  Vi*+an  (5.14) 

For  n  -  6,  (5.14)  will  be 

q(£)-  S6  ♦  a,  I5  ♦  a2  54  *a3  S3*  a4  Ij2*  a5 1  ♦  ag  (5.15) 

Values  of  ^  through  can  be  chosen  to  yield  a  prescribed  system  response 

in  the  z-plane.  The  unit  circle  represents  underdamping  (marginally  stable 
system,  poles  on  the  jw-axis  in  the  s- plane).  Within  the  unit  circle 
represents  either  critical  or  overdamped  system  responses.  Selecting  critical 


80 


damping  leads  to  a  polynomial  q(£).  The  coefficients  of  the  polynomial  are 
functions  of  (p(k|k),  Q,  ♦.  R).  from  which  equations  of  all  non-zero  Q-  matrix 
elements  can  be  computed  as  functions  of  p(klk).  ♦,  R,  and  constants.  Thus 
constrained  eigenvalues  of  ($  -  KH)  imply  a  prescribed  system  response 
(critical,  underdamped.  or  overdamped).  They  are  regulated  by  resulting 
expressions  of  variable  Q-  matrix  elements,  which  are,  themselves,  a 
function  of  p(klk),  4>,  and  R. 

The  problem  with  this  is  that  the  optimum  estimates  may  not  always  be 
derivable  from  a  priori,  critically-damped  eigenvalues.  Keeping  the  filter 
bandwidth  open  (high)  all  the  time  is  not  the  optimal  solution  for  the 
muititarget  tracking  problem.  As  a  consequence,  the  other  method  is  more 
adequate. 

The  second  method  is  a  thresholding  mechanism  to  adaptively  open  and 
dose(i.e.,  raise  and  lower)  the  filter  bandwidth  based  upon  information 
about  the  target  s  actual  activity.  Trial  and  error  led  to  a  high '  and  Tow  Q 
based  on  the  range  and  range  rate  residuals.  Emulation  of  the  system  TWS 
estimator  revealed  that  a  system  response  lag  of  one  frame  could  be 
accommodated  by  evaluating  the  residuals  prior  to  evaluating  Kalman 
equation  (5  2).  If  the  threshold  was  broken,  the  Q-  matrix  was  set  to  high 
or  Tow  '  accordingly,  then  equation  (5.2)  was  evaluated  with  this  selscted  Q. 
This  effectively  gave  the  filter  a  sneak  preview '  one  T  ahead  with  which  to 
optimize  the  system  robustness/stabiiity.  The  fact  that  the  system  dynamics 
mode  does  change  is  the  reason  why  the  first  method  is  not  recommended  to 
be  used.  The  Kalman  filter  stability/robustness  control  can  be  summarized  as 
the  following: 
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(5.16) 


-  KH) 

<  1 

*  stable 

$(♦  *  KH) 

*  1 

*  marginally  stable 

*(♦  -  KH) 

>  1 

*  unstable 

a.  Transient  response  varies  with  the  closed  loop  filter  pole  locations. 

b.  The  input  to  the  Kalman  filter  which  gives  an  indication  that  the  target 
is  maneuvering,  are  the  residuals. 

c.  Residuals  regulate  Q,  and  Q  regulates  Kalman  filter  bandwidth, 
robustness,  and  stability. 

When  a  high  Q  is  used  on  a  non- maneuvering  target,  the  filter  is  being 
erroneously  told  that  the  constant  velocity  ;>  .tant  heading  system 
dynamics  model  is  being  violated  when  in  fact  it  isn  t.  and  vice  versa. 
Conseqently.  the  filter  estimates  are  noisier  (less  accurate)  for  non- 
maneuvering/'high  Q  than  necessary,  and  conversely,  the  filler  diverges 
(breaks  track)  for  the  maneuvering/"low'  Q  case.  In  addition  to  the 
adaptive  thresholding  techniques,  the  state  vector  estimate  accuracy  was 
improved  by  performing  a  relinearization  about  the  best  estimate 
sequentially  for  each  measurement  made.  Figure  5 A.  shows  the  sequence  of 
computations. 
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FIGURE  5.4.  Computation  Sequence  and  Sequential  Relinearization 
About  The  Best  Estimate 


G.  MULTITARGET  LOCAL  ESTIMATORS 

It  was  not  until  the  early  1970  s.  that  multitarget  tracking  theory  became 
a  major  topic  of  interest  (Ref.  IS.  181.  The  papers  by  Singer  and  Stein  (Ref. 
421.  Singer.  Sea  and  Hou»ewright  (Ref.  431.  Jaffer  and  Bar-shalom  (Ref.  441. 
Bar-shalom  and  Tse  (Ref.  451.  began  the  development  of  modern  multitarget 
tracking  techniques  that  combine  correlation  and  Kalman  filtering  theory 
(Ref.  15.  181.  In  a  dense  target  environment,  gating  only  begins  to  solve  the 
problem  of  associating  observations  with  tracks.  Additional  logic  is  required 
when  an  observation  falls  within  the  gates  of  multiple  target  tracks  or  when 
multiple  observations  fall  within  the  gates  of  a  target  track. 

Figure  5.5.  illustrates  a  typical  situation  in  which  both  types  of  conflict 
occur.  This  logic  is  required  to  face  the  ambiguity  about  the  origin  of  the 
observations,  which  originated  from  the  target  of  interest.  The  correlation 
function  takes  the  output  of  the  gating  function  and  cakes  final  observation- 
to-track  assignments.  The  various  measures  proposed  against  the 
correlation  conflict  in  automatic  tracking  can  be  reduced  to  the  two 
substantial  principles . 

1.  Application  of  the  nearest-  neighbor-  rule. 

2.  Application  of  branching  procedures. 
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0 1 , 02, 03  Observation  Positions 
P 1 ,  P2,  P3  Predicted  T arget  Positions 

Figure  3.3-  Example  of  A  Complex  Conflict 
Situation 
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1.  Nearest-Neighbor  Approach 

The  nearest-neighbor -rule  prescribes  correlation  of  that  track -result 
which  is  statistically  nearest  to  the  target  s  predicted  position  (the  search 
plot  in  case  of  track  initiation).  It  is  based  on  the  assumption,  that  the 
return  which  is  nearest  to  the  predicted  position,  is  most  probabiy  the  true 
target  s  return.  The  nearest-neighbor -rule  is  capable  of  solving  satisfactorily 
the  correlation  conflict.  At  smaller  target  distances  it  fails  to  solve  the 
conflict,  because  miscorrelations  generally  produce  deviations  of  the  track 
and  target  trajectory  and  finally  lead  to  target  loss.  The  track  updating 
process  typically  begins  with  a  gating  procedure  that  is  used  to  eliminate 
unlikely  observation -to- track  pairings.  The  simplest  multitarget  system  use 
sequential  data  processing  and  the  nearest- neighbor  association  rule.  For 
example,  this  is  normally  the  approach  used  with  a  TWS  system.  With  this 
approach,  processing  is  done  at  each  scan  using  only  data  received  on  that 
scan  to  update  the  results  of  previous  processing.  The  nearest-neighbor 
assignment  algorithm  assigns  observations  for  existing  tracks  in  a  manner 
that  minimizes  some  overall  distance  criterion.  It  looks  for  a  unique  pairing 
so  that  at  most  one  observation  can  be  used  to  update  a  given  track.  Using 
this  approach,  the  optimal  solution  is  obtained  by  assigning  observations  to 
tracks  in  order  to  minimize  the  total  summed  distance  from  all  observations 
to  the  tracks  to  which  they  are  assigned.  A  computationally  efficient 
suboptimal  solutions,  also  can  be  used  to  illustrate  one  suboptimal  solution. 
To  illustrate  one  suboptimal  solution,  the  example  shown  in  Figure  5.6,  is 
solved  using  the  following  rules: 
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1.  01  is  assigned  to  T 1  because  01  is  the  only  observation  within  the  gates 
of  T1  while  T2  has  other  observations  (02, 03)  within  its  gates. 

2  2 

2.  03  is  assigned  to  T2  because  03  is  closer  than  02  (d23<<l22 ). 

3.  04  can.  without  question,  be  used  to  initiate  a  new  track,  but  new  track 
initiation  using  02  may  be  restricted.  This  restriction  is  based  upon  the 
practical  consideration  that  multiple  observations  within  the  gate  of  a 
single  established  track  are  often  the  result  of  a  failure  in  the 
observation  redundancy-elimination  logic.  Thus,  this  restriction  serves 
to  prevent  initiation  of  eitraneous  tracks. 
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6ate 


04 


Gate 


oi,  o2  o3,  o4  *  Observation  Positions 
pi,p2  -Predicted  Target  Positions 

d  -  Distance  From  p2  to  o2 

d  -  Distance  From  p2  to  o3 


Figure  5.6.  Example  of  Gating  and  correlation  for  Two 
Closely  Spaced  Tracks 


Simple  assignment  techniques,  such  as  the  sequential  nearest-neighbor 
approach,  can  lead  to  miscorrelation  with  poor  tracking  as  a  consequence. 

The  problem  with  choosing  the  nearest  neighbor  is  that,  with  some 
probability,  it  is  not  the  correct  measurement.  Therefore,  it  will  use 
(sometimesj  incorrect  measurements  while  believing  that  they  are  correct. 
This  can  lead  to  the  loss  of  target. 

An  alternative  to  nearest-neighbor  correlation  is  the  ail  neighbor 
approach,  which  incorporates  all  observations  within  the  neighborhood,  as 
defined  by  the  gate  around  the  predicted  target  position.  The  position 
update  is  then  based  on  a  weighted  sum  of  all  observations,  with  the 
weighting  calculated  using  probability  theory  (Ref.  451.  This  procedure  is 
called  Probabilistic  Data  Association  (PDA)  since  it  associates  probabilistically 
all  the  neighbors  to  the  target  of  interest.  Then  this  probabilistic  information 
is  used  in  a  suitably  modified  tracking  filter,  called  PDA  Filter  (PDAF),  That 

accounts  for  the  measurements  origin  uncertainty.  Later  results  (Ref.  21.461 

« 

showed  that  the  PDA  did  not  perform  well  in  the  presence  of  multiple 
targets,  so  a  modified  method  denoted  Joint  Probabilistic  Data  Association 
(JPDA)  was  derived  to  include  the  presence  of  multiple  targets  (Ref.  2 1 1. 

2.  Branching  Procedures 

Branching  procedures  use  all  the  results  within  the  correlation  gate  in 
order  to  form  new  tracks.  So.  at  every  sampling  time  when  there  is  more 
than  one  measurement  in  the  validation  region  (correlation  gate)  the  track  is 
split.  The  likelihood  function  of  each  split  track  is  evaluated  in  order  to 
eliminate  unlikely  tracks.  Tracks  whose  likelihood  is  below  a  given 
threshold  are  disregarded  so  as  to  keep  the  number  of  branches  finite.  The 
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likelihood  of  a  measurement  z(k),  under  a  given  assumption  Q*  for  the  model 
is  obtained  from  z(k)  and  the  predicted  state  x(k|k- 1 )  as 

p  (z(k)l  Q,)  -Cexp  (-1/2  vT(k)W'’(k)u(k)}  (5  17) 

When  c  is  a  normalizing  constant  In  the  track-  splitting  techniques,  it  is 
necessary  to  evaluate  the  likelihood  of  a  whole  track,  Le.  of  a  succession  of 
plots.  The  fundamental  limitation  of  the  maximum  likelihood  techniques  is 
that  no  validation  test  is  made  to  control  the  truth  of  a  given  assumtion. 

This  in  practice  leads  to  a  feedforward  approach  to  adaptivity  by  comparison 
with  the  feedback  concept  underlying  the  Bayesian  approach.  The  branching 
procedures  are  marked  by  an  unavoidable  increase  in  the  number  of 
branch-tracks  in  comparison  to  the  number  of  real  targets.  With  regard  to 
the  application  in  a  real-time  system,  these  methods  are  generally 
computationally  costly,  so  they  are  often  inappropriate  as  true  supplement 
to  the  nearest-neighbor  approach. 

Involving  large  computer  burden  and  extensive  memory  requirements, 
sub-optimal  techniques  are  generally  preferred,  leading  to  simplified 
algorithms  and  requiring  storage  of  less  data.  In  practice,  a  trade-off 

i 

!  between  cost  and  effectiveness  is  needed  to  select  the  most  appropriate 

algorithm  for  each  application.  In  HEA  approach,  the  output  of  the  local 
estimator  is  assumed  to  be  tracks  which  are  formed  and  confirmed  with 
great  confidence  and  low  degree  of  uncetainty. 

» 

f 
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VI.  ALGORITHM  ICJ&ACK  FUSION 


A.  TRACK-TRACK  ASSOCIATION 

Let  x  1 1  be  the  local  state  estimate  of  a  target  i  by  a  sensor  i  and  i  **  be 

A  j  j 

the  local  state  estimate  of  a  target  j  by  a  sensor  j.  Local  estimates  x  and 

A  j  j 

x  could  be  for  the  same  target  or  could  be  for  different  targets.  It  is 

desired  first  to  test  the  hypothesis  that  these  estimates  pertain  to  the  same 
target,  in  which  case  the  two  tracks  will  be  associated  as  having  a  common 
origin.  The  optimal  test  would  require  the  entire  data  base  through  time  k 
and  is  probably  not  practical  In  view  of  this,  the  test  is  carried  out  based 
only  on  the  most  recent  estimates. 

The  decision  as  to  whether  track  T  provided  by  node  T  and  track 
provided  by  node  "j",  should  be  associated  can  be  posed  as  a  test  of 
hypotheses  by  defining  a  distance  between  the  two  tracks,  so 


(6.1) 


di'<i,j).||ii‘-xJJH2 
where  II  x 1 '  -  x  ^  II  is  the  norm  of  [x  1 1  -  x  **  J 
II  x  '  ‘  -  x  * J II  -  (E  (I  x  11  -  x  J  J  ]T  (  x  1 1  -  x  i  J  l))0*5 

-  E0JS((J  ,,-JiVii,,-5 ")) 

-  (trB(liH-5^)TlSH-5JV5 

The  distance  between  two  vectors  in  the  space  is  defined  in  the  usual  way 
from  the  norm  of  a  vector  ,  so 

HxH-x  JJ||2-  EUxM-x  JjlT|xH-xij]}  (6.2) 
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Which  is  assumed  to  have  a  Gaussian  distribution.  The  superscripts  i  and  j 
denotes  the  tracks  i  and  j  and  (i,j)  denotes  the  nodes  i  and  j  respectively. 
The  statistical  test  is 

H| 

r-  ldij(l.j)|TP'1ldij(i,j)J  £  &  (6.3) 

Mo 

Where  hypothesis  H0:  track  i  and  track  j  belong  to  different  targets 
hypothesis  :  track  i  and  track  j  belong  to  the  same  target 
and  a2  can  be  chosen  based  on  that  r  will  have  a  chi-square  distribution 
with  the  number  of  degrees  of  freedom  equal  to  the  number  of  elements  in 
the  state  vector.  The  covariance  matrix  for  the  statistical  distance  P  and  the 
resulting  fused  track  can  be  given  using  the  techniques  mentioned  in 
iRef.  471. 

The  test  to  accept  or  reject  the  hypothesis  that  the  two  tracks  are  from  the 
same  target  is  defined  using  the  similarity  threshold  a 2 
r  5  a2  ,  tracks  are  from  the  same  target 
r  >  a2  ,  tracks  are  from  different  targets 
It  is  noted  that  in  the  H,  hypothesis  E{  d1  J  (i,j) )  -0  .  The  choice  of  a2  will 

be  based  upon  the  chi-square  properties  of  r  with  some  experimentation 
probably  required  for  the  particular  application.  The  value  of  a2  would  also 
probably  be  chosen  as  a  function  of  the  target  density  if  known  (Ref.  181, 
Also  a2  can  be  chosen  to  achieve  a  specified  probability  of  correct  association 

Pc. 
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B.  PROBABILITY  OF  CORRECT  ASSOCIATION 

Evaluation  of  the  correct  association  probability  Pcis  now  considered.  The 
probability  of  false  association  Pp  can  be  found  in  a  similar  way  but  with  a 
cumbersome  calculations,  owing  to  the  non-zero  mean  value  of  d'  *  (i,j) .  The 
probability  Pc  is  equivalent  to  the  probability  that  d‘ j  (Li)  lies  inside  the 
hyperellipsoid  in  the  n-dimensionai  space,  defined  by  equation  (6.3).  It  can 
be  computed  by  resorting  to  an  ortho- normalization  procedure  of  the  matrix 
P.  which  transforms  the  hypereilipsoid  into  an  equivalent  hypersphere. 
Hence. 

a 

Pc  *  1/ (2tt)  ^  /  exp(  -  r2 /2)  p(r)  dr  (6.4) 

o 

Where  p(r)  dr  is  the  symmetric  volume  element  in  the  in  the  n  dimensional 
space.  For  n»  1 ,2,3,4,  this  expression  particularises  to 
For  n»1 

exp(  -  r*/2)  dr  «2e  (a)  (6.5) 

o 

Where  €  (.)  is  the  error  function  defined  as 

a 

€  (a)  *  1/  /in”  J  exp(  -  z2  /2)  dx  (6.6) 

o 

For  n»2 

a 

Pc  *  J  r  exp(  -  r2  /2)  dr  - 1  -  exp(  -  a2/2)  (6.7) 

o 
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For  n*3 


pc-/£T  J  r2  exp(  -  r2  /2)  dr * 2e  (a)  -  V 2/ir a  exp(  -  a2^)  (6.S) 
o 

For  n»4 

a 

Pc  *  0.5  f  i^expi  -  r2/2)  dr -i- (!♦  a2/2)  exp(  -  a2/2)  (6.9) 

o 

As  an  example,  the  values  of  Pc  for  n  *  1 ,2,3  and  4  are  shown  in  the 
tables  6.1,  6.2,  6.3.  and  6.4  respectively.  It  is  noticed  that  from  the  results 
given  in  the  previous  mentioned  tables  that,  for  the  same  similarity 
threshold  a,  the  probability  of  correct  association  decrease  as  the  number  of 
elements  in  the  state  vector  increase.  Generally,  with  a»3,  a  high  probability 
of  correct  association  is  obtained.  The  results  presented  in  tables  6.1,  6.2, 

6  3.  and  6.4  is  produced  by  an  innovative  PC  software  product  called 
TKISolver  (see  Appendix  B).  TK I  Solver  shortcuts  the  problem-solving  process 
by  eLminating  two  steps  of  developing  an  algorithm  and  writing  a  program. 
Instead,  the  user  puts  the  mathematical  model  expressing  the  relationships 
between  the  variables  directly  using  standard  algebraic  notation,  in  other 
words,  the  user  communicates  with  the  computer  at  the  level  of 
relationships  (represented  by  equations)  rather  than  at  the  level  of 
sequential  programs  and  assignment  statements.  In  it.  the  input  and  output 
information  reside  in  a  set  of  eight  sheets  and  three  sub  sheets  that  are 
viewed  on  the  screen.  The  principal  ones  are  the  Rule  Sheet,  used  for 
entering  and  displaying  the  equations  or  rules,  and  the  Variable  Sheet,  used 
for  displaying  the  variable  names.  The  Variable  Sheet  also  serves  for 
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assigning  input  values  and  units  as  veil  as  for  displaying  the  results  of  the 
solution.  As  an  example,  Figures  6.2. 6.5.  and  6 A  shov  the  variable  and  rule 
sheets  of  equations  6,5. 67,  and  6.9  respectively.  Konopask  and  Jayaraman 
view  TKISolver  by  itself  as  an  expert  system  primarily  in  the  area  of 
numerical  problem  solving  (see  Appendix  B). 
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TABLE  6.1.  PROBABILITY  OF  CORRECT  ASSOCIATION  FOR  n-1 


a 

Pc 

0.5 

0.371773290 

1.0 

0.666557870 

1.5 

0.888640141 

2.0 

0.979560553 

2.5 

0.985390484 

30 

0.999999853 

3.5 

1.0 

4.0 

1.0 

4.5 

1.0 

5.0 

1.0 

5.5 

1.0 

6.0 

1.0 

6.5 

1.0 

7.0 

1.0 

7.5 

1.0 

8.0 

1.0 
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TABLE  6.2.  PROBABILITY  OF  CORRECT  ASSOCIATION  FOR  n-2 


a 

Pc 

0.5 

0.117503097 

1.0 

0.393469340 

1.5 

0.675347533 

2.0 

0.864664717 

2.5 

0.956063066 

3.0 

0.988891004 

3.5 

0.997812509 

4.0 

0.999664537 

4.5 

0.999959935 

5-0 

0.999996237 

5.5 

0.999999730 

6.0 

0.999999985 

6.5 

0.999999999 

7.0 

1.0 

7.5 

1.0 

8.0 

1.0 

TABLE  6.3.  PROBABILITY  OF  CORRECT  ASSOCIATION  FOR  n-3 


a  Pc 


0.5 

0.019708344 

1.0 

0.182616987 

1.5 

0.500087808 

2.0 

0.763596943 

2.5 

0.897749084 

3.0 

0.973408794 

3.5 

0.993891228 

4.0 

0.998929359 

4.5 

0.999856146 

5.0 

0.999985142 

5.5 

0.999998815 

6.0 

0.999999927 

6.5 

0.999999996 

7.0 

1.0 

7.5 

1.0 

8.0 

1.0 
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TABLE  6.4.  PROBABILITY  OF  CORRECT  ASSOCIATION  FOR  n-4 


a 

Pc 

0.5 

0.007190985 

1.0 

0.090204010 

15 

0.310113507 

2.0 

0.593994150 

2.5 

0.818760149 

3.0 

0.938900519 

3.5 

0.984414126 

4.0 

0.996980836 

4.5 

0.999554274 

5.0 

0.999949690 

5.5 

0.999995647 

6.0 

0.999999711 

6.5 

0.999999985 

7.0 

0.999999999 

7.5 

1.0 

8.0 

1.0 
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-o-  fernH 
-*•  Pe  forn»2 
♦  forms 

-<>■  P?  fwrM 


a 

FIGURE  6.1.  Probability  of  Correct  Association  Pc  Versus  Similarity 

Threshold  a  For  Different  Dimensional  Space  n  of  The 
State  Vector 
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(7r)  Rule: 


194/ ! 


— 

-  variable  sheet 

St 

Input 

Name 

Output  Unit 

L 

P 

.88864014 

L 

0 

a 

L 

f 

L 

0 

x 

L 

S 

1.1137466 

L 

1.5 

x2 

L 

0 

xl 

L 

0 

c 

-  RULE  SHEET  - 

Comment 


Probability  of  correct  association 
Function  values 

Values  of  independent  variable 
Approximate  value  of  integral 
Upper  limit 
Lower  limit 

Coefficients  in  simpson's  formula 


S  Rule 


*  f-exp(-(xA2)/2) 

*  S-(x2-xl)/(3*10)*dot('c,'f) 

*  P-(2/sqrt(6 . 2932) )*S 


FIGURE  6.2.  Variable  and  Rule  Sheets  For  Equation  (6.5) 
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(li)  Input:  0 


201/! 


-  -  variable  sheet  ■■  — 

St  Input  Name  Output  Unit  Comment 


L  0  a 

P 


(lr)  Rule:  P-l-exp( - <aA2/2) ) 

— .  -■  "  RULE  SHEET 

S  Rule 

e 

*  ?-l-exp(-(aA2/2)) 


FIGURE  6.3.  Variable  and  Rule  Sheets  for  Equation  (6.7) 


201/1 
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(li)  Input: 


200/! 


- - —  .....  VARIABLE  SHEET  

Sc  Input  Name  Output  Unit  Comment 


L  P 

L  0  A 


S  Rule 


RULE  SHEET 


★  P-l- (l+(aA2/2) )*exp( - (aA2/2) ) 


FIGURE  6.4.  Variable  and  Rule  Sheet  for  Equation  (6.9) 
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C  COMPOSITE  ESTIMATE  OF  TWO  TRACKS 

Since  the  tracks  are  from  different  sensors  and  at  the  same  time  instant, 
the  notation  for  sensors  and  time  are  dropped  for  simplicity  of  notation.  If 
the  tracks  are  independent,  the  covariance  matrix  P  for  d1  ^  is  defined  as 

P-P‘‘PJ  (6.10) 

fiar -Shalom  (Ref.  33).  has  pointed  out  that  the  covariance  defined  by  (6.10) 
and  the  resultant  formation  of  r  are  not  strictly  valid  because  of  error 
correlation  between  the  two  sensor  estimates.  This  correlation  occurs,  even 
if  the  measurement  errors  are  independent,  because  of  the  common  error 
source  due  to  the  target  dynamics  that  are  seen  by  both  sensors.  A 
technique  outlined  below,  to  account  for  this  error  correlation  can  be  applied 
to  modify  the  covariance  matrix  P. 

Define  a  cross  covariance  matrix  p1 J  such  that  the  initial  condition  is 


p,J(0l0)-0 

Then,  for  k>  0  values  of  p'J(kl  k)  are  computed  using  the  recursive 
relationship : 

p,J(k|k)«  A*(k)i'k-l)|  AJ(k)|T  (6.11) 

Where 

A*(k)  -I  -  K*(k)  h*  (6.12) 

Aj(k)-I-Kj(k)hj  (6.13) 

«lt-l)-  ♦'  p,J(k-llk-l)  (*J)T  +0(k-l)  (6.16) 

The  superscripts  i  and  j  refer  to  sensors  i  and  j,  while  ♦.KJi,  and  Q  are 
defined  for  the  Kalman  filter  (Ref.  48.49).  Finally,  the  modefied  covariance, 
replacing  that  given  by  (6.10)  becomes  (Ref.  47) 

P-P*.  Vi-  pIJ-  lp'J  f  (6.15) 
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For  combining  tracks,  we  begin  by  considering  tbe  fusion  relationships  for  a 


scalar  such  that  a  composite  estimate,  using  estimates 

AC  A  |  A2  A| 

X  ■  X  ♦  C  (  X  -x) 


(6.16) 


Where  c  is  a  weighting  factor  that  will  be  chosen  so  that  the  expected  Mean 

4C  AC 

Squared  Error  (MSB)  on  x  is  minimized.  The  error  in  x  is  defined  as 
Axc  *  Ax’  ♦  c  (  Ax2  -  Ax' ) 

Then,  the  error  variance  on  Axcis  defined  as 

e2  -  E  K  Axc  )2 ) «  o  2  ♦  2c  E  I  Ax1  Ax2 1  -  2c  a  2  *  c2  oA2  (6.17) 


Where 


a2  «  E  ((  Ax’  )2 1  .  a22  -  £  ((  Ax*  )2 ) , 


>A.2 


aA2  -  E  l(  Ax2  -  Ax*  )2]  -  a2*  a22-2  E  [  Ax1  Ax2 1 

The  correlation  between  errors  is  defined  as 
E I  Ax1  Ax2 1- R12 
Equation  (22)  becomes 

e2-  ( 1  -2c  ♦  c2)a2+  c2a22+2  (c  -  c^R12  (6-1*) 

In  order  to  form  the  minimum  MSE  estimates,  we  have 

de?/dc  •  -2  (1-  c  )a2  *2c  a22*2  (l-2c  )  R12  •  0  (6.19) 

Solving  (6.19)  for  c  gives 

c  •  (a,2  -  r’2  )/  ( a  2 *  <J22  -2  R12 )  (6.20) 

In  the  special  case  of  no  correlation 
12 

R  -  0  ,  we  have 

a2  -x’  +  la2/  ( a2*  a22)|(  x2-  x1 ) 
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(6.21) 


■  (o^x1  ♦  o2x2 )/  ( o2*  a22  ) 

A1  A  2 

la  the  case  of  combining  state  estimation  vectors  (  x  ,  x  ) ,  the  same 
general  relationships  given  by  (6.16)  and  (6.17)  are  used,  except  that  the 
variances  become  covariances: 


2RK  -  I  p12  •  (p'2  )T) 


and  c  becomes  a  weighting  matrix: 

,1  12 , .  1  2  12  ,  12  ,T  ,-1 

c-lp-p  11  p  ♦  p  -  p  -  (p  )) 


(6.22) 


Finally,  using  (23)  and  (23),  the  resulting  error  variances  (or  covariances) 


are 

<J2(XC)-  cr2-(o2 .  R12)2/  (  <j  2  *  a22  -2R12)  (6.23) 

12 

In  the  special  case  of  no  correlation  (  R  *  0  ),  we  have 


a2  (xc )  -  o  2  a2/  (  a2*  a2 ) 

So.  generally  in  the  vector  case,  the  resulting  combined  vector,  which 


(6.24) 


minimizes  the  expected  error  is 

xc  »  x1  ♦  c  (  xJ  -  x1 ) 
c  -  (  p1  -  p1  J )  P’1 

and  the  covariance  matrix  associated  with  the  estimate  of  (6.23)  is 

nC  1  /  »  i  ijvT 

P  -  P  - (  P  “  p  )P(p  -  P  ) 


(6.25) 


(6.26) 


As  a  simple  example 

Consider  the  two  estimates,  each  a  2-  dimensional  vector,  with  the  following 
covariance  matrices 
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1 

p 


10 

5 


2  10  -S 

P  “ 

-5  10 

Figure  6.5.  presents  the  1  a  ellipses  corresponding  to  these  matrices, 
namely 

iT(pVx»l  1*1,2  (6.27) 

and  I  a  ellipse  corresponding  to  the  fused  estimate  whose  covariance  is 


P  -  p'  (  p  ♦  p2)*’  p2  (6.28) 

The  reduction  of  the  uncertainty  are  noticed:  the  ellipse  corresponding  to  the 
fused  estimate  is  strictly  smaller  than  the  intersection  of  the  two  ellipses 
prior  to  fusion 

Figure  6.6.  shows  the  ellipses  of  uncertainty  corresponding  to 


I  po 

3" 

p  - 

13 

i°_ 

2  pO 

-6 

P  • 

b6 

10 

and  the  resulting  fused  estimate  according  to  (6.25). 
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FIGURE  6.5.  Error  Ellipsoid  For  Fused  Independent  Tracks 
in  Eiampie  1 


10S 


FIGURE  6.6.  Error  Ellipsoid  For  Fused  Independent  Tracks 
in  Eiample  2 
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D.  OPTIMALITY  OF  HEA  TRACK  FUSION 


Assuming  Uut  there  are  two  sites  (nodes)  at  which  the  same  target  is 
tracked.  At  each  site,  each  local  estimates  x1  J  (k|k),  with  the  corresponding 
p1  ^  (klk)  are  computed.  Hence,  in  this  situation  we  are  concerned  only  about 

one  target,  the  superscript  i  is  dropped  for  the  simplicity  of  notations,  and 
there  will  be  (k|k)  with  its  corresponding  (k|k).  with  j- 1,2. 


Assuming  the  local  measurements  at  each  site  are 

z^k)  -  H^(k)  x(k)  ♦  v^(k)  j  -  1.2 
with  R^k)  the  corresponding  measurement  noise  covariance. 


Denoting 


z'(k) 

z2(k) 


HJ(k)  - 
RJ(k)  - 


H'Oc) 

LH2(k)  J 

I  m 

R'(k)  0 

0  R2(k) 


Using  the  matrix  inversion  lemma 

<  p’1  *  HT R’1  H  )*'  -  p  -  pHT(H  p  HT»R  )-'h  p 


(6.29) 

(6.30) 

(6.31) 

(6.32) 

(6.33) 


which  can  also  be  written  as 

(  p  ♦  H  R  HT  f 1  -  p'1  -  p"1  H  (  H  V  H  ♦  R"1  f  HT  p'1  (6.34) 

the  recursion  for  the  covariance  p(k|k)  can  be  rewritten  as 
p(klk)  - 1  p(ktk- 1  )*' .  HT(k)  R(k)'1  H(k)l ''  (6.35) 

-  p(k|k-l) -  p(k|k- 1 ) Hr(k) « I H(k) p(klk- 1 )Hr(k) ♦R(k)| *'H(k) p(klk- 1 ) 

(6.36) 


Defining 
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(6.37) 

(6.38) 


Mk)  -  H(k)  p(k|k- 1  )HT(k)  *R(k) 

K(k)-p(k|k-1)HT(k)*’(k) 

the  recursion  for  the  covariance  p(k)k)  can  be  rewritten  more  compactly  as 
p(klk)-|  I  -K(k)HT(k)|p(klk-t)l 

-  p(klk-l)-  K(k)*(k)HT(k)  (6.39) 

and  the  following  identity  can  be  written 
p(k|k)  HT(k)  R(k)*1  -  (  p(k|k- 1  )HT(k)  - 

-  p(k|k- 1  )HTfk)(H(k)  p(k|k- 1  )HT(k)  ♦R(k)!  *’H(k)  p(k|k- 1  )HT(k))R(kf 1 

-  p(k|k- 1  )H  r(k)(H(k)  p(k|k- 1  )HT(k)  *R(k)l  ‘1* 

(H(k)  p(k|k- 1  )HT(k)  ♦  R(k)  -  H(k)  p(k|k- 1  jH^kMRfkf1  -  K(k) 

So.  the  Kalman  filter  gain  given  by  (5-3).  can  be  given  by  the  alternate 
expression 

K(k)  -  p(k|k)  HT(k)  R(k)'1  (6.60) 

Which  in  the  case  of  HBA,  for  each  site  (node)  it  will  take  the  form 

K^lk)  -  pW)  H^(k)T  R(k)"1  (6.61) 

and  equation  (5  3)  will  have  the  form 

xJ  (klk)  -  xj  (klk- 1 )  •  K^k)  (  zJ(k)-H‘\k)xJ  (klk- 1 ))  (6.62) 

The  recursion  form  for  the  inverse  of  the  covariance  update  (klk)  is  given 
by  the  equation 

p j  (kik)'1  -  pJ  (klk- 1  f’  ♦  H^k)1  rWh^Ic)  (6.43) 

The  estimate  (klk),  using  expression  (6.41 ).  (6.42)  will  be 
xJ  (klk)  -  xJ  (klk- 1 )  ♦  pW)  H^Ck)T  R^kf*  (  zHu-H^k)#  (klk- 1 ))  (6.44) 

Multiplying  (6.43)  by  (6.44)  yields  for  j-  1 ,2 

pJ  (klk)"'  xJ  (klk)  -  |pJ  (klk- 1 )"'  ♦  hW  R^kfMk)!  xJ  (klk- 1 ) 

♦  pJ  (klk)"1  p^klk)  H^(k)T  R(k)'1  (  z^kl-H^klx^klk-l)) 


ill 


-  pJ  (Klk- 1  r1  xJ  <k|k- 1 )  ♦  hW  R(k)"'  z^tk) 


(6.45) 


Thus 

H^(k)T  R(k)'1  z-Vk)  •  pJ  (klk)'1  xJ  (klk)  -  pJ  (klk- 1 )''  xJ  (k|k- 1 ) 

(6.46  )and  this  viil  be  used  to  eliminate  the  measurements  from  the 
combined 

estimation  update  equation  which  will  be  similar  to  (6.44)  with  the 
superscript  e  instead  of  j,  i.e. 

xc  (klk)  -  xc  (klk- 1 )  ♦  pc(k|k)  Hc(k)T  Rc(k)''  (  zc(k)-Hc(k)xc  (klk- 1 )) 

(6.47) 

Using  (6.30),  (6.31)  and  taking  advantage  of  the  block-diagonal  form  of 
(6.32),  the  combined  state  updating,  given  by  equation  (6.47).  can  be 
rewritten  as 

2 

x* (klk)  -  x®(k|k-l).  p®(k|k) B^k)T R^k)'1  (  zJ(k)-H'\k)x'i <kp- 1 )) 

i 


(6.4*) 

Similarly,  the  combined  covariance  update  is  similar  to  (6.41 )  with  the 
superscript  c  instead  of  j.  Le. 

p*  (klk)'1  -  p®  (klk- 1  )*'  ♦  H*(k)T  R°(k)''H®(k)  (6.49) 

Similar  to  (6.48),  (6.49)  can  be  rewritten  as 

2 

p®  (klk)''  -  p®  (klk- 1 )'' .  R^(k)'Vrtk)  (6.50) 

J=i 


Multiplying  (6.30)  with  (6.48)  yields,  after  cancelations  (similar  to  those  in 
(6.45)) 
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(6.51) 


2 

pe  (kikf1  xc  (klk)  -  pc  (k|k- 1 )' xc  (k|k-  1 )  ♦  £  H^k)T  R(k)'1  z\k) 

j=i 

Finally,  substituting  (6.-46)  into  (6.5 1 ).  one  obtains 

pc  (klk)'1  x'  (klk)  -  p*  (klk- 1  )'’x*  (klk- 1 ) 

2 

♦  £lpJ  (klk)'1  xj  (klk)  -  pi  (klk- 1  f 1  xJ  (klk- 1  )|  (6.52) 

j=i 

Which  is  the  sought-after  expression  of  the  combined  estimate  in  terms  of 
only  the  local  estimates.  So,  the  fused  track  estimates  combine  the  local 
track  estimates  and  the  incoming  track  estimates  without  having  to  calculate 
cross  covariance  given  by  (6.1 1 ).  and  it  can  be  stated  that  the  fused 
estimates  is  the  global  estimates  for  the  tracks  in  the  overlapping  area. 
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VII.  KNOWLEDGE-BASED  TRACK  FUSION 


A.  CAPTURING  THE  EXPERTIZE  OF  A  HUMAN  EXPERT 

la  multitarget  tracking  we  are  concerned  with  the  position  of  targets  and 
their  identity  and  behaviour.  In  fact,  the  position  is  of  over-riding 
importance  because  identity  and  behaviour  mean  little  unless  they  can  be 
associated  with  position  IRef.  9).  Also,  since  we  are  concerned  with  a 
dynamic  environment  we  need  to  take  time  into  account  So.  it  would  appear 
then  that  the  first  task  with  which  we  are  faced  is  how  to  deal  with 
kinematic  information.  In  order  to  combine  track  information  from  any  two 
radar  sensors  in  a  network,  these  are  compared  to  determine  whether  they 
pertain  to  the  same  target  The  decision  process  is  called  correlation.  As  we 
mentioned  before,  we  consider  this  decision  process  implicitly  included  in 
the  fusion  process,  because  there  is  no  meaning  of  the  fusion  without  it  In 
real  life,  radar  sensors  provide  different  types  of  information  with  different 
accuracies.  A  modern  radar  display  includes  alphanumeric  characters  and 
symbols  for  directly  conveying  additional  information.  This  is  useful  when 
target  identity  and  altitude  are  to  be  displayed.  The  target  track  might  be 
shown  as  a  line  on  a  synthetic  display.  The  configuration  of  the  line 
indicates  the  direction  of  the  target  path  while  its  length  can  be  made 
proportional  to  the  target  speed.  This  kind  of  display  has  a  computer  to 
generate  the  graphics  and  control  the  radar  display.  This  permits 
magnification  of  a  selected  area,  stored  flight  plans,  stored  clutter  map  and 
so  on.  The  operator  can  communicate  with  the  computer  in  an  interactive 
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manner  by  means  of  a  keyboard,  track  ball  or  light  pen.  Sometimes,  an 
auxiliary  display  is  mounted  adjacent  to  the  main  display  to  provide  tabular 
data  that  would  otherwise  encumber  the  main  display  (Ref.  151.  Current 
systems  rely  largly  on  manual  correlations  to  form  a  coherent  picture  of  the 
underlying  situation.  An  operator  at  the  radar  display  makes  a  visual 
comparison  of  the  information  provided  by  two  or  more  sensors  and  by 
applying  his  experience  arrives  at  a  decision.  But,  even  with  moderately 
complex  scenarios,  the  workload  can  easily  overwhelm  the  limited  number 
of  operators  available  and  displays.  This  can  lead  to  important  information 
being  overlooked.  A  computer  program  which  could  reliably  carry  out  a 
large  fraction  of  this  task  would  greatly  assist  the  operation  and 
performance  of  the  entire  system.  The  operators  become  proficient  after  a 
long  time  of  practice.  Discussions  with  former  operators  (Ref.  30],  revealed 
that  they  acquired  two  things  as  they  become  experts.  First,  they 
memorized  facts  about  the  platforms  operating  in  their  area,  the  facilities 
available  in  the  countries  of  the  area  and  the  political  alliances  of  those 
countries.  Secondly,  they  learned  how  to  relate  reports  received  to  the 
above  facts  and  the  current  situation  in  a  way  which  would  allow  them  to 
develop  hypotheses  about  future  platform  motions  and  activities. 

Our  initial  approach  to  the  multisensor  information  fusion  was  to  search 
for  suitable  techniques  from  the  world  of  A I  which: 

1.  Were  well  defined. 

2.  Had  been  demonstrated  on  a  similar  type  of  problem. 

Perhaps  the  most  well  known  and  well  defined  part  of  the  A1  scene  is  expert 
systems. 
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Willi  tilt  arrivfti  of  expert  systems .  there  is  a  great  rush  to  encapsulate 
human  expertise  in  computer  software,  and  multisensor  information  fusion 
and  its  related  fields  are  obvious  areas  for  attention.  The  expert  system  can 
be  considered  in  a  wide  sense  as  an  application  of  artificial  intelligence  to 
computer  software.  The  philosophy  of  an  expert  system  is  to  produce  a 
computer  solution  to  a  problem  by  capturing  the  expertize  of  a  human 
expert.  So,  it  can  emulate  the  performance  of  a  human  expert  by 
incorporating  the  analytic  and  heuristic  knowledge  which  the  human  expert 
has.  Of  the  many  expert  systems  that  have  been  developed  in  the  last 
decade,  the  majority  of  the  successful  programs  were  designed  to  play  a  role 
analogous  to  that  of  a  human  consultant.  Not  surprisingly,  this  work  was 
done  in  areas  of  Medicine.  Chemistry,  and  Geology  in  which  there  is  an 
established  tradition  for  consultation.  Human  consultants  in  these  fields  are 
valuable  because  they  are  specialists  who  possess  extensive  knowledge 
about  particular  problem  domains. 

Expert  consultation  systems  have  focused  on  problems  in  which  a  human 
expert  s  knowledge  is  largly  factual  in  nature.  Here,  the  key  to  solving  a 
problem  lies  more  in  knowing  the  relevant  information  than  in  ingeniously 
constructing  a  solution  from  logical  principles.  The  human  expert  is 
distinguished  by  knowing  ail  of  the  factors  that  are  important,  and  by 
processing  judgment  in  combining  diverse  considerations  to  reach  a  decision. 
It  follows  that  a  corresponding  expert  system  must  have  effective  ways  to 
represent  and  employ  large  amounts  of  different  kinds  of  knowledge  bearing 
on  specialized  problems.  Generaiy,  the  expertise  is  in  the  form  of  rules,  and 
these  rules  form  the  knowledge  base  of  the  system.  Recently,  expert 
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systems  have  been  found  effective  in  planning,  monitoring,  and 
interpretation  tasks  [Ref.  511.  Researches  have  applied  this  technology  to  a 
variety  of  military  problems.  For  example,  planning  of  aircraft  missions  [Ref. 
52).  simulation  of  air  battles  (Ref.  331.  and  analysis  of  platforms  operating  in 
a  certain  area,  and  their  location  and  the  activity  in  which  they  are  engaged. 
The  symbolic  nature  of  the  information  fusion  problem  and  absence  of  a  well 
developed  approach  to  it,  suggested  that  a  system  with  rapid  prototyping 
capabilities  would  be  helpful  (Ref.  301.  Some  expert  system  shells  provide 
tools  which  make  prototyping  of  data  structure  quick  and  the  rules 
employed  for  reasoning  easier  to  implement  and  modify  than  conventional 
programs.  Figure  7.1.  shows  the  general  structure  of  an  expert  system. 

B.  KNOWLEDGE  REPRESENTATION  USING  RULES 

Rules  provide  a  formal  way  of  representing  recommendations,  directives, 
or  strategies.  They  are  often  appropriate  when  the  domain  knowledge 
results  from  empirical  associations  developed  through  years  of  experience 
solving  problems  in  an  area.  Rules  are  expressed  as  IF  ■  THEN  statements, 
as  shown  below. 

IP  :  A  is  true 

and  fi  is  true 

and  Cis  false 

THEN :  conclude  X 

This  is  a  type  of  production  rule  which  has  the  general  form  [Ref.  -131: 

IF:  logical  conditions  are  satisfied 

THEN:  take  the  indicated  action 
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When  the  IF  portion  of  a  rule  is  satisfied  by  tbe  facts,  tbe  action  specified 
by  tbe  THEN  portion  is  performed.  Among  tbe  potentially  important  assets 
of  tbe  production  rules  approach  is  that  it  provides  tbe  means  of 
understanding  bow  a  decision  was  reached  and  is  able  to  explain  and  correct 
erroneous  conclusions. 

A  controlling  framework  is  used  to  allow  tbe  user  to  access  tbe 
knowledge  base  in  tbe  manner  of  a  consultation  whereby  tbe  user  may 
volunteer  information  or  tbe  machine  may  question  tbe  user  until  sufficient 
evidence  is  g*:h?red  to  produce  useful  conclusions.  The  user  may  also  ask 
tbe  system  to  explain  its  reasoning  so  that  be  may  understand  its  reasoning 
and  undestand  bow  tbe  conclusions  were  reached. 

This  method  of  problem  solving  was  adopted  as  it  seemed  to  fit  the 
muitisensor  data  fusion  problem,  assuming  that  tbe  human  expertise  exists. 
However,  most  of  tbe  well-publicised  expert  systems  are  in  quite  different 
problem  domains  to  multisensor  information  fusion.  Examples  being  medical 
diagnosis,  fault  diagnosis  and  tbe  well-  known  mineral  prospecting  expert 
system  called  PROSPECTOR.  In  this  type  of  problem,  it  can  be  assumed  that 
all  symptoms  belong  to  tbe  same  patient,  whereas,  in  multisensor 
information  fusion,  there  is  tbe  problem  of  finding  out  which  evidence 
belongs  to  which  patients,  and  indeed  bow  many  patients  are  present.  Also 
tbe  muitisensor  information  fusion  problem  is  a  continuous,  real-time 
problem,  rather  than  a  single  shot  diagnosis. 

Tbe  heart  of  an  expert  system  is  its  corpus  of  knowledge  (Ref.  54).  When 
A1  scientists  use  the  term  "knowledge",  they  mean  tbe  information  a 
computer  program  needs  before  it  can  behave  intelligently.  The  knowledge 
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is  an  expert  system  is  organized  in  a  way  that  separates  the  Knowledge 
about  the  problem  domain  from  the  system's  other  Knowledge,  such  as 
general  Knowledge  about  how  to  solve  problems  or  Knowledge  about  how  to 
interact  with  the  user.  Several  advantages  accrue  to  this  separation: 

a.  The  same  Knowledge  can  be  used  for  more  than  one  purpose.  For 
example,  a  given  Knowledge  base  can  be  used  to  solve  a  particular 
problem,  to  provide  an  explanation  for  the  solution,  or  to  support 
computer-aided  instruction  about  the  problem. 

b.  The  power  of  the  program  can  be  extended  either  by  expanding  the 
Knowledge  base  or  by  adding  facilities  to  the  interpreter.  In  particular, 
this  ai’ows  a  large  system  to  be  developed  incrementally. 

c.  The  problem-solving  mechanisms  and  system  facilities  of  the  interpreter 
can  be  applied  to  similar  problem  domains  by  replacing  the  old 
Knowledge  base  by  a  Knowledge  base  for  the  new  domain. 


Figure  7.1.  General  Structure  of 
an  Expert  System 
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The  collection  of  domain  knowledge  is  called  the"  knowledge  base .  while 
the  general  problem-  solving  knowledge  is  called  the  "inference  engine".  A 
software  with  knowledge  organized  this  way  is  called  a  "knowledge-based 
system  .  Virtually,  all  expert  systems  are  knowledge-based  systems,  while 
the  converse  is  not  necessarily  true  (Ref.  541. 


EXPERT  SYSTEM 


Figure  7.2.  Structure  of  an  Expert  System 

As  shown  in  Figure  7.2,  the  knowledge  base  in  an  expert  system  contains 
facts  (data)  and  rules  that  use  those  facts  as  the  basis  for  decision  making. 
The  inference  engine  contains  an  interpreter  that  decides  how  to  apply  the 
rules  to  infer  new  knowledge  and  a  scheduler  that  decides  the  order  in 
which  the  rules  should  be  applied.  When  the  IF  portion  of  a  rule  is  satisfied 
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by  the  facts,  the  action  specified  by  the  THHN  portion  is  performed.  When 
this  happens  the  rule  is  said  to  fire  or  execute  [Ref.  551.  A  rule  interpreter 
compares  the  IF  portions  of  rules  with  the  facts  and  executes  the  rule  whose 
IF  portion  matches  the  facts  as  shown  in  Figure  7.3. 


Figure  7.3.  The  Rule  Interprets  Cycles  Through 
a  Match-Execute  Sequence 

C.  INVOKING  RULES  IN  A  RULE-BASED  SYSTEM 

There  are  two  important  ways  in  which  rules  can  be  used  and  invoked  in 
a  rule-based  system;  one  is  called  backward  chaining  and  the  other  forward 
chaining. 

l.  Backward  Chaining 

Backward  chaining  is  often  described  in  terms  of  goal-directed 
reasoning  or  top-down  reasoning.  In  backward  chaining  the  system  has  a 
set  of  initial  goals,  and  the  rules  are  invoked  in  reverse  order.  The  system 
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begins  by  examining  a  limited  set  of  production  rules,  whose  right-hand 
sides  are  the  goals.  The  system  then  proceeds  to  examine  the  left-hand  side 
of  the  rules  to  see  which  of  the  goals  (RHS)  are  satisfied.  As  the  rules  are 
examined  in  this  backward  unraveling,  some  premises  (of  the  left-hand  side 
of  rules)  are  unknown  (logically  unsatisfied)  and  therefore  they  become  new 
subgoals.  If  a  subgoal  is  unknown,  a  question  may  be  asked  to  determine  its 
status.  Its  strategy  can  be  summarized  in  the  following  steps: 

( 1 )  Find  a  rule  a  THEN"  pattern  that  matches  the  goal 

Found - Go  to  step  2. 

Not  Found  —  Fail 

(2)  Use  the  "IF'  part  of  the  rule  to  establish  new  sub-goal(s). 

(3)  Find  fact(s)  that  satisfies  the  new  sub-goal(s). 

2.  Forward  Chaining 

In  forward  chaining  the  system  does  not  start  with  any  particular  goals 
for  it  That  is,  it  has  no  initial  subgroup  of  production  rules  which  establish  a 
starting  point  Instead,  the  system  starts  with  a  subset  of  evidence  and 
proceeds  to  invoke  the  production  rules  in  a  forward  direction,  continuing 
until  no  further  production  rules  can  be  invoked.  Its  strategy  can  be 
summarized  in  the  following  steps: 

( 1 )  rind  a  rule  with  an  "IF  pattern  that  matches  a  fact 

Found . Go  to  step  2. 

Not  Found - Fail  to  find  a  goal. 

(2)  Assert  the  rules  THEN"  clause,  i.e  add  a  new  fact  to  the  data  base. 

(3)  Does  the  new  fact  satisfy  the  goal? 

Yes . we  are  successful,  quit 

No . Go  to  step  1 . 
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3.  Badoacfl  Versus  fonrird  Chaining 

Although  several  systems  have  been  built  emphasizing  backward 
chaining,  both  forms  of  invocation  and  evaluation  of  the  production  rules  are 
equally  valid  as  long  as  they  yield  the  same  correct  conclusions.  The  rate  of 
arriving  at  the  conclusions  will  probably  difTer  considerably  depending  on 
the  strategy  adopted.  Most  classification  problems  can  be  solved  using 
either  one  of  the  approaches  individually  or  a  mixture  for  production  rule 
evaluation.  The  shape  of  the  problem  space  determines  which  is  better.  As 
shown  in  Figure  7.4.  fan-in  calls  for  forward  chaining  and  fan-out  calls  for 
backward  chaining.  The  bidirectional  search  is  often  favorable.  Using  it  the 
forward  chaining  begins  from  the  known  facts  and  the  backward  chaining 
begins  from  the  best  hypothesis. 
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D.  METAKNOWLEDGE  AND  EXPLANATION  FACILITY 

An  expert  system  has  knowledge  that  lets  it  reason  about  its  own 
operations  plus  a  structure  that  simplifies  this  reasoning  process.  This 
knowledge  the  system  has  about  how  it  reasons  is  called  "metaknowledge", 
which  just  means  knowledge  about  knowledge.  Also,  it  is  better  to  have  what 
is  Called  an  "explanation  facility".  This  is  knowledge  for  explaining  how  the 
system  arrived  at  its  answer. 

E.  BLACKBOARDS 

Blackboards  refers  to  a  particular  Ai  problem  solving  methodology.  The 
best  known  applications  of  the  blackboard  methodology  are  HEARSAY-11,  a 
speech  understanding  system  (Ref.  561.  and  the  HASP/SIAP  sonar  data 
interpretation  system  (Ref.  57. 5$l.These  applications  effectively  processed 
regular  streams  of  data  from  a  single  sensor,  treating  any  other  information 
as  locally  static.  But  the  blackboard  methodology  is  more  generally 
applicable.  In  particular,  it  provides  a  convenient  framework  for  integrating 
maximally  reduced  information  from  multiple  sources  with  different 
temporal  characteristics. 

The  blackboard  problem  solving  methodology  originated  approximately 
1 0  years  ago  and  has  been  evolving  ever  since  (Ref.  591.  The  main  feature 
of  a  blackboard  system  is  a  global  data  store  holding  input  data  and 
hypotheses  about  the  solution  of  the  problem  derived  from  that  data. 

Related  information  is  kept  together.  The  global  data  store  is  known  as  the 
blackboard. 
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F.  REPRESENTING  UNCERTAINTY  IN  EXPERT  SYSTEMS 

Expert  systems  are  often  forced  to  make  judgements  in  the  light  of 
incomplete  or  unreliable  data.  The  general  problem  of  drawing  inferences 
from  uncertain  or  incomplete  data  has  inspired  a  variety  of  technical 
approaches.  Parade  (Ref.  60]  offers  a  review  of  different  approximate 
reasoning  techniques  which  have  been  proposed  for  dealing  with  uncertain 
or  imprecise  knowledge  in  expert  systems.  These  tecniques  can  be 
summarized  as  Bayesian  model,  Dempster 'Shafer  belief  theory,  fuzzy  logics, 
and  ad  hoc  approaches. 

1.  Bavesian  Model 

One  of  the  most  useful  results  of  probability  theory  is  Bayes  theorem, 
which  provides  a  way  of  computing  the  probability  of  a  particular  event 
given  some  set  of  observation  which  is  made.  Let 
P(H]IE)  -  the  probability  that  hypothesis  H|  is  true  given  evidence  E 

P(E!H|)  -  the  probability  that  the  evidence  E  is  observed  given  that 
hypothesis  i  is  true 

P(Hj)  -  the  a  priori  probability  that  hypothesis  i  is  true  in  the  absence  of 
any  specific  evidence.  These  probabilities  are  called  prior 
probabilities  or  priors. 

k  -  the  number  of  possible  hypotheses 
The  theorem  then  states  that 

k 

P(H|IE)  -  WBH,) '  P<H|)  /  *  P(H,)  (7. ! ) 

nz\ 
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For  a  loos  time,  the  Bayesian  model  had  been  the  only  numerical 
approach  to  inference  with  uncertainty,  since  no  quantification  was 
introduced  in  the  patterns  of  plausible  reasoning.  One  of  the  best-developed 
uses  of  Bayes  theorem  for  AI  problems  is  in  the  solution  of  pattern 
recognition  or  classification  problems.  Bayes  theorem  can  be  modified  to 
handle  a  variety  of  more  complicated  situations.  But  there  are  several 
drawbacks  to  the  use  of  Bayes  theorem.  It  is  often  difficult  to  collect  all  the 
a  priori  conditional  and  joint  probabilities  required.  Doing  so  would  require 
accumulating  a  great  mass  of  data.  Doing  so  would  also  be  very  expensive. 
But  worse,  the  data  would  be  obsolete  by  the  time  they  were  collected.  It  is 
very  difficult  to  modify  the  database  of  a  Bayesian  system  because  of  the 
large  number  of  interactions  between  the  various  components  of  it.  Also, 
evaluating  Bayes  formula  to  give  an  accurate  estimate  of  the  probability  of  a 
particular  outcome  must  be  disjoint.  It  cannot  ever  happen  that  two  of  them 
occur  at  once.  This  is  often  not  the  case.  The  accuracy  of  Bayes'  formula  also 
depends  on  the  availability  of  a  complete  set  of  hypotheses.  In  other  words, 
it  must  always  be  the  case  that  one  of  the  known  hypotheses  is  true.  For  all 
of  these  reasons.  Bayes'  theorem  does  not  appear  to  solve  problems  that 
arise  in  uncertain  reasoning  in  real-world  problems,  although  it  does  serve 
as  the  basis  for  some  probabilistic  AI  systems.  e.g.,  PROSPECTOR  [Ref.  61]. 


Several  mathematical  models  of  uncertainty,  which  depart  from  the 
usual  probability  approach,  have  been  recently  proposed,  particulary, 
Dempster-  Shafer  belief  theory  iFef.  62).  It  is  also  called  mathematical 
theory  of  evidence.  A  scheme  for  combining  evidence  which  includes 
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uncertainty  or  ignorance  was  devised  by  Dempster  and  later  formulated 
within  a  flexible  representation  framework  by  Shafer.  It  is  more  general 
than  either  a  Boolean  or  Bayesian  approach,  providing  a  formal  method  for 
integrating  knowledge  derived  from  a  variety  of  sources  for  use  in 
perceptual  reasoning.  In  this  formalism,  the  likelihood  of  a  proposition  Aj  is 
represented  as  a  subinterval.  (s(A{).  p(A{)].  of  the  unit  interval.  (0.  1).  The 
evidential  support  for  proposition  A}  is  represented  by  s(A{).  while  p(A{) 
represents  its  degree  of  plausibility,  p(A{)  can  also  be  interpreted  as  the 
degree  to  which  one  fails  to  doubt  Aj.  p(  Aj)  being  equal  to  one  minus  the 
evidential  support  of  ~Aj  (the  symbol is  the  Boolean  NOT).  Le.,  the 
plausibility  is  the  complement  of  the  support  for  ~A{.  So.  p(Aj)  is 

p(Aj)  -  1  -  s(~Aj) 

The  lower  value.  s(A{).  represents  the  support  for  that  proposition  sets 
a  minimum  value  for  its  likelihood.  The  upper  value,  p(Aj),  denotes  the 
plausibility  of  that  proposition  and  establishes  its  maximum  likelihood. 
Support  may  be  interpreted  as  the  total  positive  effect  a  body  of  evidence 
has  on  a  proposition,  while  plausibility  represents  the  total  extent  to  which  a 
body  of  evidence  fails  to  refute  a  proposition.  The  degree  of  uncertainty 
about  the  actual  probability  value  for  a  proposition  correspond  to  the  width 
of  its  interval.  So.  the  uncertainty  of  Af*  is: 

u(A|)  -  p(Aj)  -  s(Aj) 
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Dempster's  rule  of  combination  requires  tint  the  knowledge  sources  be 
independent.  The  representation  involves  the  assignment  by  a  knowledge 
source  of  "probability  masses  ”.  The  mass  allocated  by  a  certain  knowledge 
source  to  A|  is  denoted  m(A]).  To  clarify,  assume  that  there  is  a  set  of  n 

mutually  exclusive  and  exhaustive  propositions,  such  as  that  the  target  is  of 

type  Aj,A2 . A„.  The  method  of  evidential  reasoning  can  assign  a 

probability  mass  denoted  as  m(Aj)  to  any  of  the  original  n  propositions  or  to 

disjunctions  of  the  propositions.  For  example,  a  disjunction  is  the  proposition 
that  the  target  is  of  type  A(  or  A2  (denoted  as  A|  VA2)  and  the  mass 
assignment  is  denoted  as  m(A|  VA2).  There  are  (2n-l )  such  general 
propositions  (including  all  the  possible  disjunctions)  that  may  be  assigned 
mass,  and  the  masses  summed  over  ail  of  these  propositions  must  equal 
unity.  It  is  noticed  that  this  is  a  more  general  form  of  representation  differs 
from  the  standard  Bayesian  approach  in  which  probabilities  are  assigned 
only  to  the  original  n  propositions,  disjunctions  are  not  considered.  The 
representation  to  uncertainty  6  is  mass  assignment  to  the  disjunction  of  all 
the  original  propositions  and  is  denoted  by 

m(e)-m(A,VA2V . VA#)  (7.2) 

Finally,  mass  can  be  assigned  to  the  negation  of  a  proposition.  For 
example,  the  mass  assigned  to  the  negation  of  a  A  ((the  target  is  not  type  A|) 

is  denoted 

m(~Ai)  -  m(A2VA3V . VA„)  (7.3) 

The  support  s(A|)  for  the  basic  proposition  that  the  target  type  is  A  |  is  just 
the  mass  associated  with  Aj(s(A|)  -  m(Aj)).  For  a  more  complex  proposition 
such  as  that  the  target  is  either  type  A;,  A2  or  A3,  is  expressed  as 
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s(A| VAjVAj)  *  m(A|)  ♦  m(A2)  *  fli(Aj)  ♦  m(AjVA2)  ♦  £Q( A |  VAj) 

-  tn(A2VA3)  ♦  (A,  VA2VA3)  (7.-4) 

The  plausibility  of  a  given  proposition  as  mentioned  before,  is  the  sum  of  all 
masses  not  assigned  to  its  negation.  Alternatively,  p(A{)  can  be  computed  by 
summing  all  masses  associated  with  Aj  and  all  disjunctions,  including  6,  that 
contain  A{.  For  eiampie, 

p(A|)  -  m(Aj)  m(A|  VA2)  ♦ . -  m(9)  (7.5) 

The  use  of  these  Shafer 'Dempster  techniques  as  they  are  known 
appears  more  complex  than  the  use  of  the  simple  Bayesian  process  with  its 
single  set  of  probabilities  and  one  of  the  difficulties  in  pursuing  such  an 
approach  is  to  determine  whether  the  extra  complexity  is  justified  by  the 
results  which  can  be  expected.  It  is  important  to  remember  that  an  operator 
may  have  to  make  decisions  based  on  the  outcome  of  the  identity  process 
(Ref.  631.  We  can  easily  see  that  an  output  such  as,  for  example, "  the 
probability  that  the  detected  aircraft  is  an  enemy  is  at  least  30%  and  could 
be  70%  "  seems  more  likely  confuse  than  to  clarify. 

3.  Fuzzy  Logic 

The  objective  of  fuzzy  logic  is  to  modify  (or  "fuzzify")  logic  so  that  it 
applies  directly  to  informal  arguments.  Fuzzy  logic  results  from  two  stages 
of  "fuzzification 

a.  The  introduction  of  vague  predicates  into  the  object  language.  This 
result  in  some  form  of  multivalued  logic. 

b.  Treating  the  metalinguistic  predicates  true '  and  "false"  as  themselves 
vague  or  fuzzy. 

The  second  stage  is  by  far  the  most  radical  and  controversial.  Fuzzy  logics 
have  been  imported  into  AI  to  deal  with  areas  of  vagueness  and  incomplete 
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information.  Most  expert  systems,  for  example,  are  forced  to  taice  decisions 
when  not  all  the  facts  pertaining  to  the  decision  are  available.  In  such 
contexts  it  is  natural  to  employ  logics  which,  unlike  classical  logic,  are  suited 
to  reasoning  with  such  incomplete  information.  Non -monotonic  logic  has  also 
been  developed,  largiy  by  the  A I  fraternity  itself,  to  deal  with  reasoning 
with  incomplete  information.  Moreover,  many  concepts  employed  in  natural 
language  and  AI  are  claimed  to  be  "vague",  and  the  necessity  of  reasoning 
with  such  concepts  suggests  that  some  logic  of  vagueness  is  appropriate 
(Ref.  64].  For  example,  the  concept  "young",  it  may  be  said  that  people 
under  1 0  years  of  age  are  young  and  those  above  60  years  are  not  young. 
However,  there  is  no  particular  day  at  which  a  person  s  age  switches  from 
young  to  not  young '.  rather,  this  is  a  gradual  transition.  In  fuzzy  logic,  the 
concept  of  young  is  expressed  by  a  "membership  function"  representing  the 
degree  to  which  a  person  of  a  particular  age  can  be  considered  to  be  young. 

It  should  be  said  that  many  applications  of  fuzzy  logic  are  both 
philosophically  and  practically  controversial,  and  the  whole  area  is.  at 
present,  controversial. 

Zadeh  offers  two  main  reasons  for  adopting  fuzzy  logic  iRef.  65,66). 
First,  he  claims  that  it  avoids  the  complexities  introduced  by  regimentation 
of  informal  argument;  secondly,  he  claims  that  it  is  the  proper  way  to 
acknowledge  that  true'  and  false'  are  not  precise  but  fuzzy. 

In  Fuzzy  Logic  (FL)  the  set  of  truth-values  of  the  base  logic,  the  set  of 
points  in  the  interval  (0.1).  is  replaced  by  fuzzy  subsets  of  that  set.  Zadeh 
does  not.  however,  allow  all  fuzzy  subsets,  This,  it  is  claimed,  would  result  in 
unmanageable  complexity'.  Instead,  Zadeh  employs  only  a  countable  and 
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structured  set  of  fuzzy  subsets  of  (0,1]  referred  to  as  "linguistic  truth' 
values".  More  explicitly,  the  Truth-Values  (TV)  set  of  FL  is  assumed  to  be  a 
countable  set,  TV,  of  the  form 

TV  -  (true/aise,  not  true,  very  true,  not  very  true, 
more  or  less  true,  rather  true,  not  very  true. 

not  very  false . )  (7.6) 

Each  element  of  this  set  represents  a  fuzzy  subset  of  (0,1  ].  Moreover,  each 
element  of  TV  is  generated  from  the  fuzzy  set  denoted  by  the  term  true . 

So.  for  example,  if  Utwis  the  membership  function  of  the  fuzzy  subset  true. 

then  the  membership  functions  for  the  other  members  of  TV  might  be  given 
as  follows: 


iWv) 

*  iVwd-  u) 

(7.7) 

•  1  *  Uin.i  If) 

(7.8) 

**) 

-  <iWi»))J 

(7.9) 

Urathcrtruaf  V) 

-  «W»))"2 

(7.10) 

etc.,  where  v  is  the  fuzzy  variable.  So  that  once  the  meaning  of  true',  and 
the  rules  of  computation  are  fixed,  then  so  is  the  meaning  of  all  the  members 
of  TV  As  a  consequence,  the  meaning  of  the  linguistic  truth-values  (that  is, 
the  fuzzy  subsets  they  denote)  is  crucially  dependent  upon  the  meaning 
chosen  for  true'.  Moreover,  it  is  quite  difficult  to  see  such  a  choice  as 
anything  other  than  arbitrary.  Zadeh  hints  that  the  choice  is  motivated  by 
the  specific  area  of  discourse  under  consideration.  Consequently,  the 
meanings  assigned  to  the  linguistic  truth-values  are  localised. 
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How  are  the  logical  constants  ~  ,  & ,  V  and  ->  is  used  to  obtain  their 

meanings  in  a  regime  where  truth  values  are  elements  of  TV?  As  a  first 
step  we  might  proceed  as  follows: 


(-A!  (») 

1  (fAl(t>» 

(7.11) 

!A  &  Bl  (») 

•  1A]  (V)  A  IB]  (V) 

(7.12) 

(A  VBI(v) 

•  |A)(V)  V(BKv) 

(7.13) 

IA  -» Bl(v) 

-  [Al  (v)  ->  IBI  (v) 

(7.14) 

where  the  connectives  ~  ,  & ,  V  ,  1 ,  A  and  are  those  of  the  base  logic 

and  each  (A)  denotes  a  fuzzy  subset  of  (0.1]  represented  above  by  its 
membership  function.  But  there  is  a  problem  with  this  way  of  proceeding. 
We  want  each  sentence  in  the  language  to  denote  not  just  an  arbitrary  fuzzy 
subset  of  (0.1  ]  but  rather  an  element  of  TV.  Unfortunately,  the  above 
semantics  offers  no  guarantee  of  this.  Zadeh  circumvents  this  difficulty  by 
introducing  the  notion  of  a  linguistic  Approximation"  (LA).  Bach  fuzzy 
subset  A  of  (0.1]  has  associated  with  it  an  element  A*  of  TV.  it  is  called 
Linguistic  Approximation  (LA).  This  is  expressed  as 

A'-LA(A).  (7.15) 

Unfortunately,  there  is  not  an  obvious  candidate  for  the  notion  of  best'  of 
such  approximation,  nor  a  general  technique  for  computing  good1  ones.  But 
whatever  the  merits  of  this  notion.  Zadeh  employs  it  to  provide  the  meanings 
of  the  logical  constants  as  follows: 


I*A1(») 

•  LAC  \v.  "1  ((A]  (U») 

(7.16) 

(A  <t  Bl  (V) 

-  LA(  \W.((A]  (U)  A(B](y))) 

(7.17) 

IA  VBI(V) 

-  LA(  \W.((A|  (tr)  V(B)  (xe))> 

(7.  IB) 

IA  ->  Bl(v) 

-  LAC  \P.((A]  (v)  (Bl(v))) 

(7.19) 
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Now.  the  functions  [Al  and  (B)  associate  with  each  sentence  an  element  of  TV. 
the  set  of  fuzzy  truth-values. 

The  introduction  of  fuzzy  truth-values  paves  the  way  for  a  rather 
radical  approach  to  inference.  According  to  Zadeh.  inference  is  only 
approximate'.  Zadeh  illustrates  his  notion  of  approximate  reasoning  by 
reference  to  examples  of  the  form 
a  is  small 

a  and  b  are  approximately  equal 

b  is  more  or  less  small 
To  illustrate,  consider  the  statement 
a  is  small 

Under  the  administration  of  classical  logic  this  proposition  would  be 
rendered  true  just  in  case  a  belongs  to  the  set  which  constitutes  the 
extention  of  the  predicate  small  In  fuzzy  logic,  however,  things  are 
somewhat  more  involved.  The  predicate  small  is  fuzzy,  and  proposition  "a  is 
small”  is  interpreted  as  the  assignment  of  a  fuzzy  predicate  as  the  value  of  a 
variable  which  corresponds  to  an  implied  attribute  of  a.  More  explicitly,  this 
proposition  would  be  interpreted  as  the  assignment  equation 
Heighda)  -  small 

where  "Height”  is  the  implied  attribute.  In  equational  terms  the  second 
premise  of  our  example  would  be  rendered  as 

(HeightU),  Height(b))  -  approximately  equal 
where  the  right-hand  side  represents  a  fuzzy  subset  of  (0,1 )  x  [0, 1]. 

In  general,  then,  a  proposition  of  the  form 
(a, . an)  isC 

is  rendered  as  the  assignment  equation 
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R(l||W«fm.|  Iq)  *  C 

where  R  is  the  implied  attribute.  For  simplicity  Zadeh  writes  this  as 
(a,  •»)  “  C 

The  premises  of  our  example  thus  constitute  a  pair  of  assignment  equations 
of  the  form 

a  -  small 

(a,  b)  •  approximately  equal 

and.  in  general,  a  collection  of  propositions  (al| . ai0)  is  G.  0  $  i  s  n- 1 

yield  a  set  of  equations 

(aii . aift)  -  Ci  0*i2n-l  (7.20) 

For  Zadeh.  approximate  inference  amounts  to  solving  such  systems  of 
equations.  As  with  equations  in  ordinary  algebra,  we  can  solve  for  any  of 
the  variables  involved  in  the  equations.  As  an  illustration,  solving  for  b  in 
our  example  yields: 

b  -  LAlsmall 0  approximately  equal), 
where  0  is  the  composition  of  fuzzy  relations,  and  is  given  by 

Uuull  *asprexiMi«ly«etMl(b)  • 

V  A  equaj(X.  b)l  (7.21) 

x 

where  V  represents  the  supremum  over  ail  objects  in  the  domain  of  the 
x 

fuzzy  predicate  'small". 

Intuitively,  the  composition  of  the  predicate  small"  and  the  binary 
relation  "approximately  equal"  represents  the  fuzzy  predicate  which  returns 
that  value  which  represents  the  best  fit,  between  those  objects  which  are  in 
the  domain  of  small,  and  which  are  approximately  the  same  height  as  b. 
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According  to  Zadeh,  the  consequence  of  a  given  set  of  premises  depends  in 
an  essential  way  on  the  meaning  attached  to  the  fuzzy  sets  which  appear  in 
the  premises.  This  is.  apparently,  a  consequence  of  the  local  character  of 
fuzzy  TV’s.  Consequently,  validity  can  only  be  characterised  semantically, 
and  the  traditional  notions  of  completeness  and  consistency  are  peripheral  to 
fuzzy  logic. 

In  the  light  of'  the  features  of  fuzzy  logic,  it  would  seem  that  it  lacks 
the  precise  formal  rules  of  inference  in  addition  to  the  absence  and  apparent 
irrelevance  of  consistency  and  completeness  results  and  the  employment  of 
a  philosophically  suspect  theory  of  truth.  All  these,  engender  a  feeling  of 
insecurity.  Indeed,  as  Haak  points  out  (Ref.  67).  fuzzy  logic  seems  hardly 
recognisable  as  a  logic  at  all.  Probably,  the  best  defence  of  fuzzy  logic  is 
located  not  in  its  conceptual  foundations  but  in  its  potential  applications 
(Ref.  68).  After  all.  many  formal  frameworks  have  been  employed  with 
much  success  even  though  their  conceptual  foundations  have  been  in  a  sorry 
state. 

4.  Ad  hoc  Approaches 

Many  researchers  in  artificial  intelligence  have  felt  a  need  for 
alternatives  of  the  standard  Bayesian  approach  and  hove  proposed  and  used, 
generally  with  success,  more  empirical  models,  particularly  in  eipert 
systems  such  as  MYCIN  (Ref.  69J,  and  others  (Ref,  70,71).  Also  many  expert 
systems  employ  some  form  of  numerical  assignments  to  assertions  which  are 
often  combined  in  ways  which  suggest  that  such  assignments  behave 
mathematically  like  probabilities. 


136 


One  of  the  earliest  approaches  to  reasoning  with  uncertainty  was 
incorporated  into  the  MYCIN  system  (Ref.  69).  it  introduced  a  notion  of " 
approximate  implication"  using  numbers  called  "  certainty  factors'*  which 
were  used  to  indicate  the  strength  of  a  heuristic  rule.  For  example.  MYClN's 
knowledge  base  includes  the  rule: 

IF  The  infection  is  primary-bacteremia  and  the  site  of  the  culture  is 
one  of  the  sterile  sites  and  the  suspected  portal  of  entry  of  the 
organism  is  the  gas tro- intestinal  tract 
THEN  There  is  suggestive  evidence  (.7)  that  the  identity  of  the  organism 
is  bacteroides. 

The  number  .7  is  the  certainty  factor  (in  the  range  0  to  1 )  of  the 
conclusion.  In  MYCIN.  assertions  are  not  just  true  or  false,  the  reasoning  is 
vague  or  inexact  and  is  indicated  on  a  numerical  scale.  MYClN's  conjunction 
operator  performs  a  minimisation,  and  its  disjunction  is  furnished  with  a 
Bayesian  interpretation.  To  elaborate,  all  assertions  being  considered  by 
MYCIN  have  associated  with  them  two  numbers,  a  Measure  of  Belief  (MB) 
and  a  Measure  of  Disbelief  (MD).  The  MB  of  a  hypothesis  h  given  evidence  e 
is  the  proportionate  decrease  in  disbelief  in  h.  and  can  be  thought  of  in  terms 
of  probabilities  as 

1  if  Pf  h)  •  1 

MB(h.e)  -((aulP<hl#)J><h)) -  P(li  Maul  1.0} -  P(H))  othereiu  (7.22) 

Similarly,  the  MD  is  the  proportionate  decrease  in  belief  in  h  as  a  result  of  e 
1  ifP(h)-0 

MDlh.e]  -((ain(P(hl«)J>(h))-P(h))/(ffiifl(1.0)-P(H))  otherriu  (7.23) 
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A  particular  piece  of  evidence  either  increases  the  probability  of  h,  in  which 
case  MB(h,e)  >  0  and  MD(h,e)  -  0  (i.e.,  there  is  no  reason  to  disbelieve  h),  or  it 
decreases  the  probability  of  h.  in  which  case  MD(h.e)  >  0  and  MfTh.e)  -  0. 

This  relationship  can  be  seen  from  the  above  formulas  for  MB  ar  d  MD. 

From  these  two  measures,  an  overall  estimate  of  the  confidence  of  the 
system  in  its  belief  about  the  hypothesis  can  be  computed.  This  estimate  is 
called  the  Certainty  Factor  (CF)  and  is  given  as 

CFfh.el  -  MBIh.el  -  MDlh.el  (7.24) 

It  is  noticed  that  if  CF  is  positive,  the  system  believes  that  the  hypothesis  is 
true;  if  CF  is  negative,  there  is  more  evidence  against  it  and  the  system 
believes  it  to  be  false.  By  separating  this  measure  into  the  two  components 
MB  and  MD.  the  problem  of  slight  confirmatory  evidence  being  interpreted 
as  disconfir mation  is  avoided.  Considering  several  pieces  of  evidence,  the 
measures  of  belief  and  disbelief  of  a  hypothesis  given  two  observations  s( 
and  s2  are  computed  by; 

0  iflfBU^&^l-l 

MB(h.S|&32)  ♦  MB(h.s2]*<l-MB(h,S|))  othenriw  (7.25) 

0  if  •  1 

MDfh,Si&s2] -{mDIPji)  ♦  MJWh^l’tl-MlMh.si))  otherwise  (7.26) 

One  way  to  state  these  formulas  in  English  is  that  the  measure  of  belief 
in  h  is  0  if  h  is  disbelieved  with  certainty.  Otherwise,  the  measure  of  belief 
in  h  given  two  observations  is  the  measure  of  belief  given  only  one 
observation  plus  some  increment  for  the  second  observation.  This  increment 
is  computed  by  first  taking  the  difference  between  1  (certainty)  and  the 
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belief  given  only  the  first  observation.  This  difference  is  the  most  that  can 
be  added  by  the  second  observation.  The  difference  is  then  scaled  by  the 
belief  in  h  given  only  the  second  observation.  A  corresponding  explanation 
can  be  given,  then,  for  the  formula  for  computing  disbelief.  From  MB  and 
MD.  CF  can  be  computed.  These  formulas  meet  several  requirements  that 
one  might  wish  them  to  satisfy,  including  commutativity,  the  order  in  which 
a  set  of  observations  is  made  is  irrelevant. 

A  simple  example  will  show  how  these  functions  operate.  Suppose  that 
an  initial  observation  has  been  done  that  confirms  our  belief  in  h  with 
MB-0.3.  Then  MD(h.e]-0  and  CF(h,S|)-0.3.  Now  a  second  observation  has 
been  done,  which  also  confirms  h.  with  MB(h,s2)-0.2.  Now 
MBitUi&S}]  -  0.3  ♦  0.2*07  -  0.44 
MDfh,Si&s2l  -  0 
CFlh,3|&s2]  -0.44 

From  this  example  it  can  be  seen  how  slight  confirmatory  evidence  can 
accumulate  to  produce  increasingly  larger  certainty  factors. 

Sometimes  it  may  be  necessary  to  consider  the  certainty  factor  of  a 
combination  of  hypotheses.  It  can  be  computed  from  the  MB  and  MD  of  the 
combination.  The  formulas  MYCIN  uses  for  the  MB  of  the  conjunction  and 
the  disjunction  of  two  hypotheses  are 

MB(ht&h2,el  -  min(MB|hlfe),  MB(h2,e!)  (7.27) 

MB(h|  or  h2,e|  -  max(MB(h|,el,  MB(h2,ej)  (7.28) 

MD  c.in  be  computed  analogously. 

From  tactical  intelligence,  the  advantage  of  employing  deterministic 
values  instead  of  probabilities  is  often  called  for.  For  instance,  an  enemy  will 
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most  likely  make  a  maximum  g  terminal  maneuver  rather  than  the  average 
from  a  Monte  Carlo  simulation  of  all  possible  maneuvers.  This  tactical 
doctrine  may  prescribe  a  precise  maneuver.  Also,  for  very  good  reasons, 
people  often  feel  uncomfortable  estimating  prior  probabilities  (Ref.  721.  Yet, 
they  are  willing  to  say  whether  a  piece  of  evidence  increases  or  decreases 
the  probability  of  a  hypothesis  with  respect  to  its  prior  value,  and  are  often 
willing  to  use  the  certainty  value  to  estimate  the  amount  of  change.  Thus, 
certainties  are  particularly  useful  as  a  technique  for  talking  about  relative 
probabilities.  In  addition,  they  seem  more  natural  than  probabilities  when 
establishing  the  context  for  a  hypothesis. 

By  contrast  to  probability  theory,  these  ad  hoc  approaches  provide  the 
expert  with  a  language  for  more  directly  specifying  how  degrees  of  belief 
(expressed  as  subjective  probabilities)  are  to  be  computed.  The  language 
does  impose  some  constraints.  It  requires  that  functions  for  computing 
probabilities  be  composed  out  of  small  number  of  primitive  functions. 
However,  it  also  provides  considerable  freedom,  such  as  allowing  the 
specification  of  any  loop-free  network  topology  desired  to  group  factors  and 
control  the  flow  of  information.  The  price  paid  for  this  freedom  is  that  there 
is  no  longer  any  guarantee  that  all  of  the  axioms  of  probability  theory  will  be 
honored.  However,  if  one  views  the  values  computed  as  heuristic  measures 
of  degree  of  belief,  then  the  only  question  is  whether  or  not  it  is  easy  to 
construct  an  inference  network  that  adequately  approximates  the 
specifications  of  the  expert.  A  commonly  voiced  criticism  of  such  approaches 
is  that  they  are  unnecessarily  ad  hoc.  It  is  claimed  that  there  are  alternative 
approaches  available  which  are  better  documented  and  understood  (Ref.  72). 
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Mamdari  and  Efstathion  (Ref.  731  for  example,  claim  that  fuzzy  logic  itself 
would  provide  more  secure  foundation  for  the  enterprise.  As  a  matter  of 
fact,  PROSPECTOR  already  employs  some  form  of  fuzzy-sets  theory,  at  least 
according  to  the  recent  account  given  by  Gaschning  (Ref.  61). 

Generally,  it  is  hard  to  quantify  and  mathematically  relate  such 
subjective  parameters  as  partial  ECM.  intelligence  levels,  the  threat  and 
human  experience.  So,  it  is  adequate  and  easy  to  use  ad  hoc  approaches  to 
approximate  the  specifications  of  the  expert. 

G.  EXPERT  SYSTEM  DEVELOPMENT  TOOLS 

It  is  important  to  distinguish  between  expert  systems  and  expert  system 
development  tools.  As  shown  in  Figure  7.1,  expert  systems  are  specific 
applications  consisting  of  a  knowledge  domain  and  an  inference  engine.  A 
knowledge  domain  is  generally  the  human  expertize  on  a  particular  subject 
The  inference  engine  is  the  reasoning  software.  To  obtain  advice  from  an 
expert  system,  the  user  poses  a  problem  via  a  user  interface.  The  inference 
engine  accepts  the  request  reasons  about  the  query  and  the  knowledge 
stored  in  the  knowledge  domain,  and  responds  to  the  user.  Expert  systems 
can  be  used  to  provide  expert  advice  and  solve  probtems  using  a  given 
knowledge  domain.  When  a  user  presents  a  particular  problem  to  the  expert 
system,  it  uses  the  available  reasoning  knowledge  to  infer  some  advice, 
which  it  then  reports  to  the  user. 

An  expert  system  development  tool  can  take  the  form  of  an  Artificial 
Intelligence  (Al)  language  such  as  Lisp  or  Prolog,  an  expert  system  shell,  or 
an  integrated  artificial  intelligence  environment.  Shells,  a  higher  level  of 
development  tool,  facilitate  the  development  of  expert  systems  by  providing 
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a  generalised  inference  engine  and  the  ability  to  create  knowledge  domains 
in  any  subject  area.  An  integrated  expert  system  environment  provides  aii 
tbe  capabilities  of  a  sbeii  as  veil  as  other  tools,  such  as  decision  support 
software.  Expert  system  shells  or  environments  (shell  supersets)  that 
support  rule  representation  consist  of  a  rule  set  editor  (also  known  as  a  rule 
set  manager),  an  inference  engine,  and  generally,  a  user  interface.  Actually, 
the  user  interface  and  the  rule  editor  can  be  embeded  in  the  inference 
engine.  The  developer  of  an  expert  system  uses  a  rule  set  editor  to  write 
rules.  The  inference  engine  uses  the  contents  of  the  knowledge  domain  to 
arrive  at  its  solution  or  recommendation. 

H.  PAIRWISE  CORRELATION  FOR  TRACK  FUSION 

Track  fusion  is  a  difficult  process  and  numerous  algorithms  have  been 
defined  for  track  correlation  (Ref.  47, 63, 74, 75. 76, 77J.  Many  of  these  rely 
on  probabilities  to  combine  evidence,  while  others  make  hard  yes  or  no 
decision.  Because  of  the  diversity  of  sensors  that  are  operating  in  a  large 
area  surveillance  system,  the  state  vector  coordinate  system  where  the 
sensor/node  level  multitarget  tracking  problem  is  resolved  will  differ  for  the 
various  local-track  data  bases.  Assuming  that  the  required  coordinate 
transformation  at  each  node  is  done  using  the  appropiate  transformation 
algorithm,  the  information  collected  by  each  node  is  overlaid  in  a  common 
coordinate  system.  Furthermore,  the  time  points  at  which  the  target  states 
(latitude,  longitude,  etc.)  are  estimated  may  not  coincide.  Before  fusing  any 
two  tracks,  they  are  referred  to  a  common  time  instant  by  using  the 
prediction  equations  of  the  Kalman  filter  [Ref.  22]. 
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Following  Lakin  and  Miles  (Ref.  9).  we  adopted  pairwise  correlations  for 
track  fusion.  Figure  7.5,  shows  the  steps  used  in  pairwise  correlations  to 
solve  the  correlation  ambiguity,  which  involves  3  distinct  rule-driven  steps 

1.  First,  assume  all  tracks  (local  and  incoming)  available  at  each  node  are 
separate  and  each  track  implies  a  new  target. 

2.  Second,  apply  rules  which  create  the  possible  pairwise  correlations 
between  each  incoming  track  and  existing  local  tracks.  Those  fail  the 
pairwise  correlations  are  considered  new  tracks  for  targets  beyond  the 
coverage  of  local  sensor/sensors. 

3.  Thirc.  apply  rules  to  confirm  str'mg  correlations  and  to  deny  others. 
Where  alternatives  are  of  similar  strengths,  wait  for  further  evidence. 
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I.Stepl. 


Assume  all  tracks  available  at 
each  node  are  separate  and  each 
track  Implies  a  new  target. 


@0@ 


@@@ 


2.  Step  2. 

Apply  rules  which  create  the 
possible  pairwise  correlations 
between  each  incoming  track 
and  existing  local  tracks.  Those 
fail  the  pairwise  correlations 
are  considered  new  tracks. 


3.  Step  3. 

Apply  rules  to  confirm  strong 
correlations  and  to  deny  others. 
Where  alternatives  are  of  similar 
strengths,  wait  for  further  evidence 


T1 ,  T2,  T3  T  argets  T  racks 

fill  32, 33  Different  Assumed  Targets 

C  Pairwise  Correlation  Process 


FIGURE  7.5.  Pairwise  Correlation  for  Track  Fusion 
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The  correlation  process  is  basically  based  on  the  position  and  velocity 
closeness  of  each  two  tracks.  The  neit  logical  step  is  to  combine  the 
correlated  tracks  to  get  the  best  estimate.  A  straightforward  and  a  fairly 
obvious  approach  is  taking  a  weighted  average  with  most  weight  being  given 
to  the  most  accurate  sensor.  Another  approach  is  using  one  track  from  a 
correlated  pair  or  group  (one  which  is  beiived  the  most  accurate)  as  a 
representative  track  so  that  the  other  tracks  are  filtered  out.  One  of  the 
primary  reasons  for  using  this  approach  is  that  it  serves  the  immediate 
purpose.  If  the  accuracy  available  from  a  single  sensor  is  sufficient  to  satisfy 
the  needed  requirements,  then  it  is  sensible  to  use  it. 

As  it  is  mentioned  before,  the  positional  correlation  process  is  a 
prerequisite  for  the  fusion  of  identity  and  behavioural  information.  As 
noted  earlier  these  types  of  information  are  of  little  practical  use  unless  they 
are  associated  with  position.  When  we  correlate  kinematic  information  we 
deal  with  dimensional  data  to  which  we  can  apply  recognized  mathematical 
tests  for  correlation.  However,  identity  and  behaviour  cannot  be  treated  the 
same  way.  Expert  systems  can  be  used,  in  which  inference  is  performed 
using  both  sensor  data  and  rules.  Their  structure  allows  the  utilization  of 
fully  different  kinds  of  information  regardless  of  its  form.  This  means  that 
each  information  source  is  allowed  to  contribute  information  at  its  own  level 
of  detail.  They  process  heuristic  knowledge,  apply  logical  inference,  and 
reason  with  the  human  knowledge  stored  in  the  computer.  Rather  than 
facts,  this  knowledge  represents  the  human  rules  of  thumb'  stored  non- 
proceduraliy  in  the  knowledge  domain.  Like  humans  they  can  reason  about 
uncertain  situations,  factoring  degrees  of  uncertainty  into  the  reasoning 
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process.  Certainty  factors  can  be  assigned  and  carried  throughout  the 
reasoning  process,  then  reflected  in  the  advice  the  expert  system  derives. 
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FIGURE  7.6.  An  Expert  System  Architecture  For 

Track  Fusion 
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VIII.  THE  SIMULATION  SCENARIO 


A.  EXPERTS'  VIEW  OF  MODERN  RADAR  ENVIRONMENT 

Technically,  computers  are  concerned  only  with  those  radar  signals  which 
relate  to  responses  from  aircrafts,  and  it  is  therefore  unnecessary  and 
uneconomic  to  feed  them  with  ail  of  the  unwanted  echoes  which  occur  as  a 
radar  antenna  sweeps  through  360°,  out  to  the  range  of  its  transmitted 

power.  To  be  able  to  achieve  this  situation  the  radar  signals  of  the 
concerned  targets  require  to  be  converted  into  a  digitised  format.  In  this 
form  the  radar  signals,  radar  data,  or  radar  information  can  be  fed  into  a 
computer,  to  be  either  displayed  directly  onto  a  radar  display  or,  if  the 
computer  has  the  capacity  or  is  linked  to  an  additional  computer  containing 
the  relevant  flight  plans  and  information,  then  both  types  of  information  can 
be  correlated  before  being  further  processed  onto  the  operator  s  display.  A 
further  key  factor  in  this  correlation,  is  the  allocation  of  identity  codes  to 
individual  aircrafts,  through  the  use  of  Identification  of  Friend  or  Foe  (IFF) 
for  military  aircrafts,  or  a  Secondary  Surveillance  Radar  (SSR)  for  civilian 
aircrafts.  Also,  there  is  the  ability  to  enables  those  aircrafts  which  do  not 
carry  a  transponder  (SSR  or  IFF)  to  appear  on  the  operator  s  display  with  a 
computer 'related  indication  of  their  identity. 

By  allocating  to  the  concerned  aircraft  a  discrete  IFF  or  SSR  code,  and  also 
by  informing  the  computer  that  the  code  so  allocated  refers  to  the  specific 
flight  information  relevant  to  the  aircraft,  the  computer  is  able  to  recognize 
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the  radar  information  which  it  receives,  and  to  correlate  this  information 
with  the  flight  information  already  in  its  possession.  The  act  of  relating 
flight  information  and  radar  information  begins  to  open  up  wide  horizons  for 
the  application  of  automation  to  the  tasks  of  controlling  the  air  space.  The 
operator  himself  acts  as  a  communicator,  a  navigator,  a  calculator  and  a 
predictor  of  future  events.  It  is  essential  to  recognise  that  the  advantages  in 
automation  are  in  reducing  the  workload  upon  the  operator  and  be  applied 
primarily  to  those  of  the  operator  s  functions  which  limit  his  capability  to 
discharge  his  primary  responsibility,  which  is  that  of  a  decision- maker. 

As  an  example  of  the  standard  format  of  the  type  of  information  of  the 
flight  plan  which  are  in  general  use  are: 

1.  Aircraft  type; 

2.  Aircraft  callsign; 

3.  SSR  or  IFF  code; 

4.  Aerodrome  of  departure; 

3.  The  aerodome  of  destination; 

6.  The  proposed  route  of  flight; 

7.  Estimated  Departure  Time  (EDT); 

8.  The  estimated  time  at  the  Flight  Information  Region  (FIR)  boundaries; 

9.  Height  or  desired  cruising  level; 

10.  Aircraft  s  cruising  speed; 

1 1.  Type  of  flight  (e.g.  military /scheduled/generai  aviation); 

The  radar  viewing  units  have  also  changed  dramatically  from  the  original 
cathode  ray  tube.  It  is  usual,  in  modern  radar  units  to  which  automation  is 
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being  applied,  to  use  a  synthetic  type  of  radar  display.  A  modern  radar 
display  oonsole.  adds  the  following  facilities  to  the  conventional  radars: 

a.  Position  symbols  and  labels  adjacent  to  position  symbols  of  selected 
aircrafts  displaying  important  flight  plan  information. 

b.  Trail  dots,  which  appear  behind  the  aircraft  s  symbol  to  indicate  the 
track  which  it  has  been  following. 

c.  Visual  alarms  for  an  aircraft  emergency,  special  hi-jack  code,  or  radio 
faliure,  these  are  in  the  form  of  flashing  symbols  and  labels. 

d.  Tabular  areas,  these  are  areas  upon  which  can  be  displayed  any 
information  of  interrst  to  the  operator. 

e.  Synthetic  map  displays,  such  as  the  outline  of  airways,  and  air-routes, 
the  coast  lines,  danger  areas,  etc.  These  maps  are  usually  programmed 
within  the  consol's  computer  memory.  Also  there  are  facilities  which 
exist  for  the  operator  to  draw- in  on  his  display  a  synthetic  map  for  any 
special  purpose,  such  as.  for  example,  a  military  exercise  area  or  a 
temporary  prohibited  area  for  an  air  display. 

Figure  8.1,  provides  some  idea  of  the  type  of  information  which  can  be 
presented  on  TWS  radar  displays  as  applied  to  modern  ATC  systems  (Ref. 

78, 79, 80,8 1 1.  For  example,  for  satisfying  the  requirements  of  modern  ATC 
systems,  which  must  have  instantly  available  information  that  is  both 
accurate  and  reliable,  the  aircraft  itself  is  able  to  co-operate  with  the  ground 
based  radar  systems.  That  is.  it  can  carry  its  own  airborne  equipment, 
known  as  a  transponder',  which  is  capable  of  communicating  with  the 
ground-based  SSR  system.  The  transponder  is  activated  by  pairs  of  pulses 
transmitted  by  a  ground  interrogator,  and  its  reaction  is  to  transmit  a  train 
of  pulses  on  a  different  radio  frequency  to  the  SSR  interrogator  receiver  on 
the  ground.  Because  the  transponder  is  not  relying  upon  reflected  energy 
from  the  aircraft  to  provide  a  radar  echo,  but  is  making  a  full-blooded  reply 


150 


itself,  this  enables  the  transmitters  on  the  ground  to  be  of  lover  pover  and 
employ  simpler  and  cheaper  technology  and  also  ensure  a  certainty  of  signal 
return,  unaffected  by  veather  or  other  clutter  factors.  Also  the  returning 
train  of  pulses  from  the  aircraft  can  be  coded  to  contain  information 
pertinent  to  that  specific  aircraft  such  as.  for  example,  the  identity  of  the 
aircraft  and  the  height  at  which  it  is  flying.  This  factor  gives  the  SSR 
receiver  and  its  computer  processor  the  ability  to  separate  and  identify 
different  targets  in  a  manner  that  the  ordinary  radar  cannot  do.  and  then  be 
able  to  compute  additional  information  such  as  the  speed  of  the  aircraft  and 
its  flight  attitude,  all  without  recourse  to  any  radio  telophony  speech  with 
the  pilot,  other  than  an  initial  request  to  select  a  special  group  of  code 
numerals  on  his  SSR  select  panel  in  the  cockpit. 
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Target  trait  added 
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Figure  5.1.  Modern  ATC  display  vitli  SSR  Information 
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B.  THE  SCENARIO 

The  HEA  approach  is  going  to  be  applied  on  three  different  TWS  systems 
A.  B.  and  C  These  systems  are  assumed  to  be  tied  together  by  appropriate 
direct  communication  links  as  shown  in  Figure  8.2.  Each  system  has  detected 
two  different  targets  using  its  own  radar  sensor.  The  local  data  of  each 
system  gathered  by  its  own  radar  sensor  have  been  processed  using  its  local 
estimation  process,  resulting  in  two  well  confirmed  and  distinct  tracks.  So. 
sii  tracks  will  result  from  the  local  estimators  of  the  three  TWS  systems. 
These  tracks  are  Al.  A2.  Bl.  B2.  Cl.  and  C2  from  TWS  systems  A.  B.  and  C 
respectively.  The  corresponding  targets  for  these  tracks  are  al.  a2.  bl.  b2. 
cl.  and  c2  for  tracks  Al,  A2.  Bl.  B2.  Cl,  and  C2  respectively.  Using  Equation 
(2.3).  the  network  shown  in  Figure  8.2,  is  represented  as 
G  -  (V.  E) 

where  V  -  (A.  B.  C) 

and  E  -  ( IA.  B).  [A.  Cl.  (B,  Cl ) 

The  TWS  systems  together  are  assumed  to  cover  a  large  air-space  area 
with  a  partially  overlapping  fields  of  view.  Each  system  is  going  to  use 
locally  its  track  information  resulting  from  its  local  estimator  (as  that 
discussed  in  Chapter  V) ,  in  addition  to  sending  it  to  the  the  other  two 
systems  via  the  communication  link  used  to  tie  each  of  the  two  other 
systems  with  the  local  one.  By  this  way  the  area  of  coverage  of  each  system 
is  extended  to  cover  the  whole  area  covered  by  the  three  systems.  This 
means  that  each  system  may  get  track  Information  pertaining  targets 
beyond  the  coverage  of  its  local  radar  sensor.  This  can  give  the  operator  in 
each  system  s  site  an  advance  details  on  aircrafts  which  are  due  to  enter  his 
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sector  of  responsibility  which  seems  to  be  of  great  help  especially  in  ATC 
systems  and  Q  systems.  It  is  assumed  that  the  suitable  network  access 
protocol  which  secure  error  free  exchange  of  track  information  is  used,  and 
the  needed  coordinate  transformation  and  the  referring  to  the  common  time 
instant  as  discussed  in  Chapter  IV  is  done. 

It  is  also  assumed  that  the  type  of  modern  radar  display  console  like  that 
shown  in  Figure  8.1 ,  is  used  at  each  TVS  system.  The  fusion  process  is  going 
to  be  performed  in  TVS  system  A  based  on  its  local  track  information  and 
the  other  track  information  sent  to  it  from  TWS  systems  B  and  C.  A  pairwise 
correlation  process  is  used  to  correlate  each  of  the  tracks  A 1 .  and  A 2  with 
the  other  tracks  Bl.  B2.  Cl.  and  C2.  The  correlation  process  is  going  to  be 
based  on  the  kinematic  information  of  the  last  report  of  each  track  (range, 
bearing  and  speed),  in  addition  to  its  identity  code,  intent  and  behaviour.  A 
simplified  expert  system  approach  as  outlined  in  Chapter  VU.  using  EXSYS 
expert  system  development  package  is  used  to  perform  the  fusion  process. 
The  correlation  process  (embeded  in  the  fusion  process)  will  be  basically 
started  based  on  the  position  and  velocity  closeness  of  each  two  tracks  by 
thresholding  the  absolute  value  of  the  difference  between  their  position  and 
velocity  to  a  certain  threshold  value.  This  threshold  value  could  be  chosen 
based  on  the  error  covariance. 

Two  expert  systems.  TRAFUS1  and  TRAFUS2  are  developed.  TRAFUS1  is 
a  kind  of  an  experimental  hard  decision  knowledge-based  track  fusion 
approach.  TRAFUS2  is  a  kind  of  an  experimental  soft  decision  knowledge- 
based  track  fusion  approach.  In  these  expert  systems,  simple  inference  rules 
are  used  to  perform  the  correlation  process  and  eliminate  obviously 
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impossible  pairing  of  tracks.  The  rules  are  allowed  to  be  easily  modified, 
added  or  deleted.  New  rules  entered  can  be  checked  against  the  existing 
rules  for  consistency. 

TRAFUS1  and  TRAFUS2  obtain  data  needed  to  make  a  decision  by  asking 
the  user  questions  relevant  to  the  tracks  needed  to  be  fused.  The  user  can 
also  ask  how  the  expert  system  reaches  a  decision.  These  two  features  are 
very  helpful  in  training  novice  operators  in  real  life  applications.  Almost  no 
training  is  required  to  run  any  of  them  or  any  already  developed  expert 
system  using  EXSYS  expert  system  development  package.  Details  of 
TRAFUS1  and  TRAFUS2  are  presented  in  Chapter  IX  and  Chapter  X 
respectively.  Their  rules  are  described  in  Appendix  £  and  Appendix  F 
respectively. 
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ffluiitioo  Scenario 


IX.  TRAfUS.l 


A.  RUNNING  TRAFUS1 

TRAPUS(  from  TRAck  FUSion),  refers  to  an  experimental  system  to 
explore  the  applicability  of  artificial  intelligence  techniques  to  the 
implementation  of  an  automated,  extremely  flexible  track  fusion  consultant. 
To  run  TRAFUS1,  the  user  enters  EXSYS  TRAFUS1  or  just  EXSYS  and  he  will 
be  asked  for  the  filename  which  will  be  TRAFUS1.  If  TRAFUS1  is  on  other 
than  the  default  drive,  then  he  should  enter  the  drive  designator  with  the 
filename.  For  example.  EXSYS  BrTRAFUSl.  Then  he  will  be  asked  if  he 
wishes  instructions  on  how  to  use  the  program.  After  that  he  selects  if  he 
wishes  to  have  rules  displayed  as  the  program  runs.  Then  he  will  be  asked 
questions  relevant  to  the  subject.  The  user  answers  by  selecting  one  or  more 
answers  from  a  list  or  entering  a  numeric  value.  The  expert  system  will 
oontinue  to  ask  questions  until  it  has  reached  a  conclusion.  The  conclusion 
may  be  the  selection  of  a  single  solution  or  a  list  of  possible  solutions 
arranged  in  order  of  likelihood.  TRAFUS1  can  explain,  in  English,  how  it 
arrived  at  its  conclusion  and  why.  If  possible,  the  program  will  derive 
information  from  other  rules  rather  than  asking  the  user.  This  ability  to 
derive  information  allows  the  program  to  combine  many  small  pieces  of 
knowledge  to  arrive  at  logical  conclusions  about  complex  problems.  The 
rules  editor  of  EXSYS  allows  the  rules  to  be  easily  modified,  added  or  deleted. 
All  knowledge  base  files  for  TRAFUS1  are  kept  in  two  parts:  one  with  a  .RUL 
filename  extension  and  one  with  a  .TXT  filename  extension.  Both  must  be  on 


137 


viil  start  asking  the  user  questions  relevant  to  the  subject  area  of  the 
knowledge  base.  This  is  how  the  program  obtains  the  data  needed  to  make  a 
decision.  There  are  two  types  of  questions  he  may  be  asked:  multiple  choice 
and  numeric  value. 

Multiple  choice  questions  will  display  a  statement  ending  in  a  verb, 
followed  by  a  numbered  lJst  of  possible  completions  of  the  sentence.  The 
user  should  enter  the  number  or  numbers  of  the  choices  correct  for  his 
situation  and  press  the  (ENTER)  key.  if  more  than  one  number  is  chosen,  the 
numbers  should  be  separated  with  a  space  or  a  comma.  If  numbers  outside 
the  range  of  the  list  are  entered,  the  program  will  re-ask  the  question.  In 
fact  the  user  won  t  get  past  the  question  until  he  answers  it. 

The  other  type  of  information  the  user  may  be  asked  for  is  a  numeric 
value.  There  will  be  an  explanation  of  what  information  the  program  needs 
and  a  space  to  inter  the  value  by  typing  it  and  pressing  (ENTER).  The 
number  can  include  a  decimal  point. 

The  expert  system  will  oontinue  asking  questions.  When  the  program  has 
obtained  enough  information  to  determine  that  all  the  IF  conditions  in  a  rule 
are  true,  it  will  display  the  rule,  (unless  the  user  has  opted  to  not  have  rules 
displayed  as  they  are  used),  if  the  computer  determines  that  any  of  the  IF 
conditions  in  a  rule  are  false,  it  will  reject  the  rule  and  go  to  the  next 
appropriate  rule.  An  example  of  the  results  obtained  from  a  run  of  TRAFUS 1 
is  presented  in  Figure  9.1. 
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FIGURE  9.1.  Results  Obtained  from  a  Run  of  TRAFUSI 
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B.  ASKING  ABOUT  RULES 


When  «  rule  is  displayed  the  user  has  the  option  of  asking  how  TRAFUS1 
knows  a  condition  in  the  IF  part  is  true.  To  do  this  he  enters  the  line 
number  of  the  IF  condition.  The  expert  system  will  respond  with  one  of  four 
responses: 

1.  The  expert  system  will  display  the  rule  or  rules  that  allowed  it  to  derive 
the  information.  A  rule  used  for  derivation  will  have  information  about 
the  condition  the  user  are  asking  about  in  its  THEN  part.  He  can  then 
continue  asking  how  the  expert  system  knew  that  rule's  IF  conditions 
were  true  and  so  on.  If  the  user  asks  about  a  condition  that  is  an 
algebraic  expression,  the  values  of  each  of  the  variables  in  the 
expression  will  be  displayed.  He  may  then  asks  how  these  values  were 
derived  by  entering  the  number  of  the  variable. 

2.  If  the  user  asks  the  expert  system  how  it  knows  a  condition  is  true 
that  it  did  not  derive,  but  determined  by  asking  him  for  input,  it  will 
respond  that  he  told  it  the  information  was  true. 

3.  The  user  can  ask  for  information  about  a  condition  that  is  several 
conditions  down  in  the  list  and  which  the  expert  system  may  not  have 
yet  tested.  This  can  occur  when  the  user  asks  the  expert  system  WHY  in 
response  to  its  question.  If  this  is  the  case,  the  program  will  respond 
that  it  does  not  yet  know  if  the  condition  is  true  or  not. 

4.  In  certain  situations  where  the  expert  system  has  just  derived  new 
information,  it  may  tell  the  user  that  the  condition  he  is  asking  about  is 
false  and  the  rule  will  be  eliminated. 

Rules  may  have  references  for  the  source  of  the  knowledge  (  e.g.  personal 
observation,  book,  article,  etc.).  If.  when  a  rule  is  displayed,  the  user  presses 
(R1  the  expert  system  will  display  the  reference  for  the  rule  if  it  has  one. 
When  the  user  are  finished  examining  the  rule,  by  pressing  (ENTER!,  the 
expert  system  will  continue  asking  him  questions. 
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C  USING  "WHT 

If  the  user  wonders  why  the  expert  system  needs  to  know  the 
information  it  is  requesting,  he  can  ask  it  by  typing  WHY.  instead  of  making 
a  selection  from  the  list  of  values,  and  presses  the  {ENTER!  key.  The  expert 
system  will  respond  by  displaying  the  rule  it  is  trying  to  determine  the 
validity  of.  He  may  now  ask  the  expert  system  about  the  IF  conditions  or 
references  as  described  before.  Then  he  presses  (ENTER)  when  he  is  finished 
examining  the  rule.  The  expert  system  may  now  have  the  question 
originally  asked  redisplayed  or  it  may  display  another  rule.  If  the  later  is 
the  case,  it  is  because  the  first  rule  displayed  was  being  used  only  to  derive 
information  needed  by  the  second,  and  the  second  is  the  rule  actually  being 
tested.  TRAFUS1  will  continue  showing  the  rules  it  is  using  to  derive 
information  until  it  reaches  the  base  rule  it  is  trying  to  apply.  This  rule  will 
have  at  least  one  choice  in  its  THEN  part  By  pressing  the  (ENTER)  key.  the 
program  will  be  continued.  If  more  than  one  rule  was  displayed,  each  time 
the  user  presses  (ENTER)  he  will  go  one  rule  up  the  list  being  used  in  the 
derivation.  He  will  then  be  reasked  the  question  he  responded  to  with 
•WHY". 

D.  ASKING  HOW  A  CONCLUSION  WAS  REACHED 

The  user  can  ask  the  expert  system  how  it  arrived  at  its  final  value  for  a 
specific  choice  or  why  a  statement  is  displayed.  If  he  enters  the  line  number 
for  any  choice  or  statement.  TRAFUS1  will  respond  by  displaying  ail  of  the 
rules  it  used  to  determine  the  value  of  that  choice  or  statement.  He  again 
has  ail  of  the  options  in  requesting  more  information  about  each  of  the  rules 
as  discussed  above.  If  he  wishes  to  learn  why  a  choice  not  displayed  was 
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eliminated  by  being  given  a  probability  value  of  0.  be  presses  (A)  to  have  all 
choices  displayed.  Then  he  enters  the  line  number  of  the  choice  in  question. 

B.  CHANGING,  RERUNNING  AND  PRINTING  THB  DATA 

The  user  can  easily  test  and  analyze  the  effect  his  input  had  on  the  final 
outcome.  He  can  change  one  or  more  of  his  answers,  while  holding  the 
remainder  constant,  reruns  the  data  and  sees  what  effect  the  changes  have 
on  the  final  outcome.  The  current  value  for  the  choices  can  be  saved  for 
comparison  with  the  new  values.  To  change  the  data  he  presses  (C).  He  will 
be  asked  if  be  wishes  to  save  the  current  values  for  comparison  with  the 
new  ones  he  will  be  calculating.  TRAFUS1  will  then  display  a  list  of  all  of  the 
information  he  the  user  provided  by  answering  questions.  Then  he  enters 
the  number  of  the  statement  he  wants  to  change  and  the  expert  system  will 
reask  that  question.  By  answering  the  question  with  the  new  values  that  he 
wishes  to  try.  the  program  will  return  to  the  display  of  all  of  the  information 
that  he  told  it.  The  user  continues  changing  statements  until  the  data  is  the 
way  he  wants  it,  then  presses  (R)  to  rerun  the  data.  If,  due  to  the  changes, 
the  program  realizes  that  it  needs  more  information,  it  will  ask  for  it.  The 
rules  will  not  be  displayed  during  the  rerun.  The  program  will  then  display 
the  new  list  of  choices.  If  he  opted  to  have  the  previous  values  for 
comparison,  they  will  be  displayed  in  parenthesis. 

He  can  change  the  data  again  in  almost  the  same  way.  The  only 
difference  is  that  when  he  presses  (CJ  he  will  be  given  three  options: 

1.  Keep  the  original  values  for  comparison. 

2.  Keep  the  most  recently  calculated  values  for  comparison. 

3-  Don't  keep  any  comparison  data. 
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The  ability  to  change  and  rerun  the  data  allows  the  user  to  test  the  expert 
system  and  see  if  an  answer  that  he  were  not  sure  of  is  vital  to  the  final 
outcome,  or  really  has  little  effect.  He  can  save  a  printed  copy  of  the  results 
of  the  run  by  pressing  (PI.  Then  he  will  be  asked  if  he  wishes  to  have  the 
data  he  told  the  expert  system  also  printed.  If  he  presses  [Y]  he  will  have 
both  the  final  sorted  list  of  choices  printed  along  with  all  of  the  data  he 
provided  the  expert  system  in  answer  to  its  questions. 

F.  SAVING  DATA  AND  RESULTS 

The  user  has  the  option  of  storing  the  data  he  has  input  into  TRAFUS1, 
exiting  the  program,  and  being  able  to  return  to  the  same  point  later.  This 
can  be  useful  if  he  needs  to  look  up  information  needed  by  TRAFUS1  or  if  he 
must  leave  the  program  but  don't  want  to  loose  the  data  he  has  already 
input.  He  can  select  to  store  the  data  by  entering  QUIT  in  response  to  any  of 
the  program  requests  for  data.  The  program  will  then  ask  for  the  name  of 
the  file  to  store  the  data  in.  A  filename  of  up  to  8  characters  (  different  than 
TRAFUS1 )  is  needed  to  be  entered.  Then  he  will  be  asked  if  he  wishes  to 
return  to  the  program  or  exit  to  DOS.  Also  he  can  store  the  input  provided  to 
reach  the  conclusions  by  pressing  [Ql.  This  is  the  same  as  using  the  QUIT 
option  when  entering  data.  The  data  input  will  be  stored  in  a  disk  file  and 
he  will  be  able  to  return  directly  to  this  point.  This  is  particularly  useful  if 
he  wants  to  experiment  with  the  "  change  and  rerun"  command. 
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X.IRAEUS2 


So  far,  in  TRAFUS1.  for  the  representation  of  facts  and  of  methods  for 
deducing  new  facts  from  old  ones,  we  have  almost  always  assumed  that 
either  a  fact  is  known  to  be  true,  or  it  is  knowa  not  to  be  true  (or  nothing  at 
all  is  known  about  it).  We  have  essentially  not  considered  the  possibility 
tnat  we  might  know  something  that  is  -  probably  true  ".  However,  there  are 
situations  in  which  such  knowledge  is  important. 

In  TRAFUS2,  An  ad  hoc  approach  is  used,  using  the  EXSYS  0-10  system. 
The  systems  uses  numerical  assignments  to  assertions,  which  expressing  the 
degree  of  belief  of  the  expert  in  these  assertions.  It  is  considered  as  a 
certainty  factor  which  expresses  the  degree  of  confidence  about  a  certain 
hypothesis.  So,TRAFUS2  is  generally  similar  to  TRAPUS1,  but  the  only 
difference  is  that  the  value  following  the  "probability-  '  is  a  ratio  where  the 
denominator  is  10.  This  is  the  most  practical  system.  0/10  is  equivalent  to 
"certainly  false"  and  locks  the  value  at  0/10  regardless  of  any  other  value 
the  choice  may  have  received.  A  value  of  0/10  eliminates  the  choice  from 
further  consideration.  A  value  of  10/10  is  equivalent  to  certainly  true  '  and 
also  locks  the  value  for  the  choice  at  10/10  regardless  of  any  other  values 
the  choice  may  have  received.  Values  of  1  to  9  represent  degrees  of 
certainty  ranging  from  "very  probably  false"  to  "very  probably  true".  The 
values  from  1  to  9  do  not  lock  the  value  and  are  averaged  to  give  the  final 
value  for  a  choice. 

For  example,  if  a  choice  appears  in  three  rules  that  had  true  IF  parts  with 
values  of  3/10,  8/10,  and  <4/10,  the  final  value  for  the  choice  will  be  the 
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average:  3/10.  If  the  values  found  were  3/10, 9/10  and  0/10,  the  0/10 
would  prevail  and  result  in  a  final  value  of  0/10  regardless  of  other  values. 
Likewise,  if  the  values  were  1/10,  3/10  and  10/10,  the  10/10  would  lock 
the  value  at  10/10  regardless  of  the  previous  lower  values.  Values  of  1  -9 
are  averaged  to  a  final  value  only  if  not  over-ridden  by  a  0/10  or  10/10. 

The  first  0/10  or  10/10  prevails  and  will  not  be  changed  even  by  another 
10/10  or  0/10. 

In  developing  TRAFUS2,  several  questions  have  arisen  and  needed 
answers,  among  these  questions  are: 

1.  How  to  convert  from  human  terms  to  numeric  certainty  factors.  For 
example,  what  does  1  It  is  very  likely  that  "  mean? 

2.  How  to  normalize  across  different  people's  scales,  particularly  if  the 
solution  to  question  1  is  to  get  people  to  provide  numbers  directly. 

3.  How  far  to  propagate  changes  in  the  Confidence  Factor  (CF)  on  the  basis 
of  new  evidence.  If  the  CFlh  ,e)  changes  very  slightly  and  h  is  part  of 
the  relevant  evidence  for  another  hypothesis,  h.  should  CFth,e]  also  be 
changed?  ir  very  tiny  changes  are  always  propagated  as  far  as  possible, 
the  system  may  spend  all  of  its  time  doing  that  with  very  little  impact 
on  the  final  outcome.  On  the  other  hand,  many  small  changes  can  add 
up  to  a  significant  change  that  should  not  be  ignored. 

4.  How  to  provide  feedback  to  the  database  to  improve  the  accuracy  of  the 
CP  s  of  the  rule*.  This  problem  has  been  particularly  solved  in  MYCIN 
by  the  TEIRE5IAS  system's  ability  to  explain  the  reasoning  process  to  a 
physician  and  then  to  accept  statements  from  the  physician  about  how 
the  rules  should  be  revised. 

For  the  solution  of  problems  1,  2,  and  3.  the  word  description  of  certainty 
shown  in  Figure  10.1,  is  used.  For  solving  problem  4,  the  capability  of 
EDITXS  in  EXSYS  is  used.  Figure  10.2,  shows  the  results  obtained  from  a  run 
of  TRAFUS2. 
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XI.  CONCLUSIONS 


In  this  dissertation,  a  new  efficient  and  reliable  distributed  estimation 
architecture  for  Distributed  Sensors  Networks  (DSN)  has  been  presented.  It 
is  called  Horizontal  Estimation  Architecture  (HEA).  HEA  is  introduced 
bearing  in  mind  the  technological  advances  in  several  disciplines,  which  are 
providing  the  future  tools  for  designers  of  DSN,  especially,  the  application  of 
artificial  intelligence  and  knowledge  manipulation,  and  the  adoption  of 
decentralized  decision  making  strategies  in  complex  technological 
environments.  In  addition,  the  communication  load  is  minimized  between 
different  nodes  of  a  network  by  exchanging  locally  processed  data  between 
these  nodes  instead  of  raw  data.  Also,  the  HEA  is  developed  bearing  in  mind 
the  possibility  of  any  hostile  enemy  actions.  Including  physical  destruction 
and  electronic  countermeasures  which  can  create  node  and  link  failures  and 
a  dynamically  changing  network  topology  which  are  essential  requirements 
for  military  systems. 

A  great  motivation  to  HEA  is  the  applicability  of  partitioning  approaches 
which  allow  any  large  complex  system  to  be  divided  into  manageable 
proportions.  The  partitioning  allows  the  usage  of  microcomputer  systems, 
which  provide  a  cost-effective  solution  for  data  processing.  Obvious 
advantage  of  HEA  as  applied  to  DSN  are  local  autonomy,  heterogeneous 
feature,  reliability,  and  survivability.  Network  splitting  and  reformation  or 
connection  of  additional  compatible  networks  are  practicable  during  system 
operation  and  do  not  cause  any  restrictions  as  the  new  system  is 
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initiated.  The  main  concepts  and  features  of  HEA  were  presented  in  two 
conferences  (Ref.  82, 831. 

Through  the  integrated  view  of  the  assets  of  a  DSN  as  shown  in  Figure  2.4. 
the  HEA  has  been  applied  to  TWS  radar  systems.  This  application  shows  the 
effect  of  the  partitioning  approach  used  by  solving  the  multitarget  tracking 
problem  at  the  sensor  inode)  level  using  the  appropriate  local  estimation 
algorithm.  This  guarantees  the  maximum  utilization  of  the  local  resources  of 
each  radar  site  and  takes  advantage  of  the  aovanced  techniques  taking  place 
in  each  major  block  of  a  radar  system.  When  the  multitarget  tracking 
problem  is  solved  separately  for  each  individual  node,  a  somewhat 
redundant  view  of  the  surveillance  area  will  result,  depending  on  the  degree 
of  overlapping  coverages  between  the  radars.  The  output  of  the  local 
estimator  is  a  group  of  different  tracks.  By  solving  the  multitarget  tracking 
problem  separately  for  each  individual  sensor  (node),  the  track  estimates  for 
the  targets  in  the  surveillance  area  become  consolidated  first  at  the  level  of 
each  individual  node. 

A  major  component  of  HEA  is  the  information  fusion  process.  The 
information  fusion  process  decides  whether  more  than  one  track  from 
different  nodes  represent  the  same  target.  A  pairwise  correlation  technique 
is  used  for  and  proved  to  be  easy  to  implement.  The  corresponding 
consistent  tracks  are  combined  together.  Two  techniques  are  used  for 
information  fusion.  The  first  is  based  on  algorithmic  processing  of  track 
kinematic  information,  and  it  is  proved  that  the  fused  estimates  can  be 
considered  the  global  estimates  of  the  tracks  in  the  overlapping  area 
assuming  that  the  local  estimates  are  optimal.  It  is  also  proved  that  there  is 


169 


no  need  for  the  calculation  of  the  cross  covariance  of  the  fused  tracks  since 
the  local  estimates  and  its  associated  covariance  are  the  kind  of  information 
exchanged  between  different  nodes.  The  second  technique  is  based  on 
heuristic  reasoning  by  using  expert  systems  to  encapsulate  target  identity, 
behavior,  intent  and  human  expertise  in  computer  software.  An  EXSYS 
expert  system  development  package  is  used  for  this  purpose.  EXSYS 
employs  A1  techniques  using  currently  available  hardware  and  software.  It 
does  not  require  the  complexity  and  cost  of  LISP  driven  architectures,  nor  is 
there  is  a  need  for  large  on-site  support  staff.  The  developed  expert  systems 
can  be  used  mainly  as  an  advisory  tool  for  the  manual  operator.  With 
modifications,  it  may  directly  control  the  fusion  process  autonomously. 

Using  the  expert  system  approach,  the  correlation  process  can  be  easily 
implemented  using  simple  production  rules.  The  emphasis  in  the  programs 
TRAFUS1  and  TRAFUS2  has  been  on  the  fusion  of  track  information,  but 
fusion  processes,  especially  in  military  applications,  must  integrally  overlap 
with  planning.  ECM  effects,  tactical  doctrine,  operational  limitations,  logistics 
and  historical  reconstruction/analysis  processes.  The  modification  and 
augmentation  of  these  into  expert  systems  can  be  done. 

Further  research  is  needed  for  the  application  of  HEA  in  systems  which 
have  different  kinds  of  sensors.  Also,  many  different  kinds  of  knowledge 
engineering  approaches  are  being  applied  to  the  various  facets  of 
information  fusion  problems.  The  data  structures  of  the  applicable  expert 
systems  vary  greatly,  and,  in  general,  talk '  among  these  systems  has  not 
occurred.  A  considerable  research  effort  is  needed  to  establish  a  common 
ground  for  these  systems  to  enable  them  to  communicate  with  each  other. 
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Another  consideration  about  developing  an  expert  system  is  tacit 
knowledge.  Tacit  knowledge  has  implications  for  knowledge  elicitation 
within  the  current  state  of  the  art.  One  implication  arises  out  of  the 
invisibility  of  the  relation  between  formal  knowledge  and  skilled 
accomplishments,  our  lack  of  awareness  of  our  own  skills,  and  our  frequently 
misplaced  respect  for  theory-like  representation  of  what  we  believe  we 
know.  The  only  solution  to  the  problem  demands  that  the  knowledge 
engineer  must  do  more  than  up  the  knowledge  of  the  expert,  but  must 
undertake  at  least  a  short  apprenticeship,  a  period  of  participation  as  an 
observer,  as  part  of  the  elicitation  process. 
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APPENDIX  A 

THE  SPECTRUM  OF  SENSORS  AVAILABLE  FOR  DATA  FUSION 


Detectable 

Characteristics 

Spectral 

Range 

Sensor  Systems 

Acoustic 

Frequency 

1  Hz  -  10  kHz 

Acoustic  Detectors 
Active/Passive  Sonar 
Seismometers 

Electromagnetics 

Radio 

FrequencytRF) 

1  Hz  - 1  MHz  (LF) 

10  MHz  -  100  MHz 
(HF/VHF/UHF) 

1  -  10  GHz 

10-30  GHz  (SHF/EHF) 

30  -  300  GHz(MMW) 

Magnetometers 
Passive  ESM 

Receivers 

Radar  (Monostatic, 
Multistatic ) 

-  Surveillance 

-  Fire  Control 
Millimeter  Radar 
Radiometers 

InfraRed 
Wavelength  (IR) 

U) 

o 

• 

□ 

IR  Radiometers 
-Scanning  IR  Search 
Track 

-Focal  Plane  Arrays 

Visible  Light 
Ultraviolet  ( UV) 

0.7  -  0.4  □ 

0.-4  -  3x  10-3  □ 

Lazer  Radar 
EOSensors(TV) 

UV  Spectrometers 

Nuclear 

Paticies 

3x  102  -  3i  10-4  A 

X  -  Ray  Detectors 
Gamma  Ray  Detectors 

Non  -  Nuclear 
Particles 

Mass  Spectrometers 
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ABPEMB1XB 

OVERVIEW  OF  TKISOLVER 


Computer-assisted  problem-  solving  typically  goes  through  the  following 
steps: 

Problem  Mathematical  Model  Algorithm  Program  -4  Results 
The  first  step,  based  on  the  analysis  of  the  problem,  is  to  set  up  a 
mathematical  model  expressing  the  relationships  between  the  variables. 

Then  an  algorithm  (a  precise  description  of  how  a  computation  is  to  proceed) 
must  be  developed  for  solving  specific  problems.  Next  a  program 
implementing  the  algorithm  is  written  in  a  conventional  programming 
language  (e  g.  BASIC.  FORTRAN,  or  PASCAL).  Frequently,  the  programmer 
uses  a  diagram  called  a  ‘  flowchart"  to  facilitate  the  design  and  understanding 
of  the  algorithm  or  the  program  structure.  A  considerable  amount  of  time  is 
spent  in  these  two  stages,  especially  in  the  programming  stage  when  the 
computer  must  be  instructed  in  a  step-  by-  step  manner.  Finally,  of  course, 
the  program  has  to  be  debugged  and  run.  All  this  means  that  a  user  has  to 
think  like  a  computer"  to  solve  a  problem.  This  siphons  a  Jot  of  time  into 
other  tasks  necessary  for  fixing  an  unmatched  parenthesis  or  formatting  the 
output  correctly.  Solving  a  slightly  different  problem  often  means  modifying 
the  program  and  sometimes  even  reworking  the  algorithm. 

TKISoiver  shortcuts  this  problem-solving  process  by  eliminating  two  steps, 
developing  an  algorithm  and  writting  a  program.  Instead,  the  user  inputs 
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the  mathematical  model  directly  using  standard  algebraic  notation.  By  this 
way.  the  user  and  the  computer  interact  at  the  level  of  mathematical  models 
or  relationships  between  variables  and  the  computer  automatically  takes 
care  of  sequencing  the  operations.  So.  TKISolver  lets  the  user  focus  on  the 
problem  itself,  not  on  how  the  problem  is  going  to  be  solved  using  the 
computer.  The  TK"  in  TKISolver  stands  for  Tool  Kit.  implying  that  there  are 
several  problem-solving  tools  in  the  kit  and  more  to  be  expected.  The 
exclamation  mark  refers  to  the  Action  kee,  which  is  pressed  to  solve  a 
problem  or  to  make  other  things  happen.  TKISolver  solves  problems  for 
professionals  working  with  numbers  and  formulas.  It  does  this  by 
processing  equations  entered  in  their  natural  form  [Ref.  841. 

Considered  as  an  expert  system  [Ref.  84).  figure  B.l.  shows  the 
architecture  of  TKISolver.  The  domain  specific  knowledge  responsible  for  the 
high  performance  of  the  system  is  contained  in  the  knowledge  base.  The 
problem  solving  tools  embodying  the  control  strategy,  the  direct  and 
iterative  solvers,  utilize  the  knowledge  base  in  the  process  of  solving 
particular  problems.  For  interaction  (or  I/O).  TKISolver  provides  "sheets" 
displayed  through  one  or  two  windows  on  the  screen.  The  main  feature  of 
the  architecture  is  the  explicit  division  between  the  knowledge  base  and  the 
control  strategy.  Consequently,  the  expert/user  deals  only  with  issues  of 
domain  specific  knowledge,  and  is  insulated  from  the  details  of  the 
implementation  of  the  control  strategy. 

In  the  following  paragraghs,  the  four  components  of  the  knowledge  base, 
the  characteristics  of  a  model  and  the  problem-solving  mechanism  are 
described: 
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1.  Rules:  The  rule  is  the  basic  component  of  the  domain-specific 
knowledge.  It  expresses  the  underlying  mathematical  relationship  in 
terms  of  the  equality  of  left-hand  and  right-hand  side  expressions. 
Equations,  constraints,  or  definitions  may  all  be  represented  as  rules. 
The  set  of  rules  can  be  represented  in  the  form  of  a  network  of 
relationships  called  R-graph  (for  relationships  graph).  A  variable  is 
represented  by  a  node  in  the  R-graph  and  each  subgraph  or  polygon 
corresponds  to  a  rule  in  the  knowledge  base. 

2.  Unit  Conversions:  Units  of  measurement  are  associated  with  most 
measurable  quantities.  Conversions  between  them  are  frequently 
encountered  in  problem  solving  and  have  to  be  defined  in  the 
knowledge  base.  The  unit  conversion  feature  in  TK!  Solver  simplifies 
the  conversion  between  the  different  units  of  measurement. 

3.  User  Functions:  Empiric  relationships  between  sets  of  variables  are 
expressed  in  the  form  of  user-defined  functions. 

4.  Built-In  Knowledge:  Irrespective  of  the  domain-specific  knowledge. 
TKISolver  can  solve  problems  involving  basic  arithmetic  operations  and 
a  large  variety  of  built-in  mathematical  functions.  For  example, 
trigonometric  functions,  hyperbolic  functions,  exponential  function,  root 
function,  natural  and  decadic  logarithms  and  circular  or  inverse 
trigonometric  functions.  A  standard  variety  of  these  is  supplemented 
by  a  few  special  ones  like  ■'element"  for  retrieving  list  components  or 
"apply"  for  associating  empiric  functions  with  arguments. 

5.  Model:  The  model  encompasses  the  first  three  components  of  the 
knowledge  base  in  Figure  B.l.  (rules,  unit  conversions,  user  functions) 
as  contained  in  the  rule,  variable,  unit,  and  user  function  sheets.  In 
more  general  terms,  the  model  can  be  seen  as  a  compact,  high-level 
representation  of  structure,  organization,  and  content  of  the  domain 
knowledge.  The  composition  of  the  model  coupled  with  its  elegant 
internal  representation  allow  for  a  simple  yet  powerful  control 
strategy.  The  model  also  serves  as  a  user-  friendly  guide  during  the 
problem-solving  process.  The  model  usually  reflects  a  certain  part  of 
the  knowledge  base  in  a  particular  discipline.  The  models  may  be 
easily  merged  by  the  subsequent  loading  of  some  or  all  of  the 
knowledge  base  components  into  TKISolver,  in  order  to  create  larger 
models  capable  of  addressing  more  complicated  problems.  There  is 
also  the  concept  of  TkJSolver  packs  or  sets  of  models  from  particular 
disciplines. 
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<  model)  : :  -  <  rule  >(((newline}<  rule  >}) 

This  meins  that  i  model  consists  of  it  least  one  rule;  each  rule  must 
start  on  a  new  line,  and  there  may  be  any  number  of  lines  between  the 
rules. 

<  rule  >  : :  -  <  expression  >-<  expression  >(<  comment  >1 

This  means  that  a  rule  is  represented  by  two  expressions  linked  by  an 
equal  sign  and  followed,  optionally,  by  a  comment. 

<  comment  >  : :  -  "<  character  string  > 

This  means  that  a  rule  comment  consists  of  a  quotation  mark  (”) 
followed  by  a  string  of  characters. 

6.  Problem-Solving  Mechanism:  The  direct  solver  is  the  workhorse  of  the 
problem-solving  mechanism.  In  it  lies  the  grace  and  power  of 
TKISolver.  It  manipulates  the  equations  depending  on  the  problem 
formulation  and  solves  for  the  unknown.  If  an  incosistency  error  or  an 
illegal  operand  is  detected,  the  solution  process  is  terminated,  and  the 
rule  causing  the  problem  is  flagged  with  the  appropriate  error 
message.  Since  the  solution  path  depends  on  the  problem  formulation, 
the  control  strategy  may  be  considered  as  forward  chaining  or  data- 
driven.  Whenever  the  direct  solver  cannot  match  the  nature  and 
complexity  of  a  given  problem,  the  iterative  solver  can  be  used.  The 
heart  of  the  iterative  solver  is  a  modified  Newton-Raphson  procedure 
which  handles  sets  of  simultaneous  linear  and  nonlinear  equations  []. 

It  can  be  either  explicitly  invoked  or  automatically  called  when  the 
direct  solver  fails  to  produce  a  solution. 


PRODUCTION  RULES 


In  most  areas  of  human  decision  making,  the  reasoning  processes  can  be 
modelled  by  rule-based  systems.  A  rule,  known  in  these  applications  as  a 
"production  rule"  or  a  "production",  is  generally  of  the  form 
IP  Fi  and  F2  and . and  Fa  THEN  C 

or  equivalently, 

Fj  fc  F2  & .  k  Fa  -♦  C 

where  Fj  is  a  fact,  an  event,  a  situation,  a  string  of  symbols,  or  a  cause,  and  C 
is  a  conclusion  or  hypothesized  conclusion,  an  action  to  be  performed,  or  an 
effect  Some  of  the  rules  in  a  production  system  represent  the  knowledge  of 
trained  experts,  and  others  provide  system  organization. 

In  addition  to  an  organized  set  of  rules,  a  production  system  must  have  a 
data  base  consisting,  typically,  of  gathered  pieces  of  evidence  which  might  be 
relevant  to  the  condition  in  the  left  side  of  some  rules.  System  organization 
is  provided  by  several  kinds  of  control  mechanisms.  An  evaluation 
mechanism  is  needed  to  evaluate  the  left  side  of  a  rule  based  on  the 
evidence  in  the  data  base.  It  is  desirable  to  have  a  mechanism  for 
augmenting  and  modifying  the  system.  A  production  system  also  needs 
direction  and  weighting  mechanism. 

Figure  Cl.  is  an  illustration  of  a  tree  structure  of  a  very  simple  production 
system.  The  AND  arcs  denote  single  productions  (where  multiple  conditions 
must  be  satisfied  fcr  the  conclusion  to  follow),  and  OR  inputs  are  separate 


productions.  The  direction"  mechanism  of  a  production  system  relates  to 
reasoning  processes,  where  inferring  and  deducing  new  information  from 
evidence  can  be  considered  opposite  in  direction  from  hypothesizing  and 
then  testing  the  hypothesis.  One  type  of  system  direction  is  forward 
chaining  or  running;  these  systems  start  with  input  data  and  proceed  up  to 
conclusions.  Backward  chaining  (running),  or  top  down,  start  with 
hypothesized  conclusions  that  are  selectively  generated  and  proceed  to  see  if 
they  are  supported  by  the  data  base.  Some  systems  use  an  ad  hoc 
combination  of  up  and  down  directions. 

When  using  a  production  system,  there  is  often  associated  with  each  F}  in 

the  premise  a  quantity  known  as  a  certainty  factor",  which  indicates  the 
likelihood  that  Fj  is  true  based  on  the  input  data.  Also,  for  most  production 

rules,  the  premise  leads  to  the  conclusion  with  .  say.  an  $o%  or  90% 
probability,  instead  of  absolutely  true  or  false.  Similarly,  there  may  be 
significant  probability  that  the  conclusion  is  true  even  when  the  premise  is 
not  satisfied.  Measures  of  the  latter  two  likelihoods  are  known  as 
‘  strengths  ',  "attenuation  factors",  or  "oertainty  factors".  All  of  these 
quantities  can  be  used  in  estimating  the  certainty  factor  of  a  conclusion. 
Many  conclusions  are  intermediate  conclusions  that  are  then  treated  as  facts 
for  future  productions.  "Weighting"  is  a  term  that  refers  to  these  quantities 
and  their  propagation  through  the  tree.  Weighting  can  be  used  to  determine 
the  reliability  of  final  conclusions  and  also  to  reduce  the  number  of 
computations  through  the  pruning  of  unlikely  hypotheses.  If  the  statistics 
of  the  process  are  known  sufficiently,  Bayesian  weighting  can  be  used.  A 
more  common  method  of  weighting  is  to  use  ad  hoc  scoring  functions.  When 
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the  conditions  Fj  or  the  evidence  about  them  cannot  be  considered 

independent,  fuzzy  set  theory  can  be  applied.  For  example,  the  fuzzy  set 
computations  P(F|. . .  P„)  -  min  P(F|)  can  be  used  at  the  AND  nodes. 

An  advantage  of  a  production  systems  is  that  it  can  be  designed  to 
provide  high  user  confidence.  The  user  can  read  the  lists  of  rules  and  can 
question  any  conclusion,  and  the  system  can  present  to  him  the  facts  and 
logideading  to  the  conclusion.  If  he  disagrees  he  can  change  the  rules;  with 
an  appropriate  mechanism  for  modification  and  augmentation,  modular 
pieces  of  knowledge  in  the  form  of  production  rules  can  be  added  or  changed 
without  difficulty.  In  automated  fusion  applications,  these  system  attributes 
are  especially  important.  A  user  is  unlikely  to  accept  the  system's  conclusion 
if  does  not  understand  the  logic  behind  it  or  previous  conclusions.  And  he 
must  be  able  to  correct  or  refine  the  system  and  to  incorporate  new 
knowledge  into  the  system  when  changes  occur  in  hostile  force  procedures 
or  equipment. 

Aside  from  the  obviously  difficult  task  of  acquiring  rules,  there  are 
several  special  problems  that  will  be  encountered  in  applying  production 
systems  to  fusion  problems.  At  the  system  front-end  there  is  the  problem  of 
evaluating  the  left  side  of  a  rule  based  on  the  conceptually  structured  data 
obtained  through  the  processing  of  natural  language  reports.  In  platform 
identification  applications,  much  of  the  data  will  be  inaccurate  or  even 
totally  wrong  because  of  deception  or  human  error.  Moreover,  conclusions 
will  often  have  to  be  updated  because  of  the  continual  arrival  of  data.  There 
are  also  the  geometrical  problems  of  track  association  to  be  solved,  but  these 
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Probably  the  greatest  problem  with  production  systems  or  any  automated 
system  is  tbat  there  are  innumerable  nonroutine  situations  which  could 
occur.  Wtile  a  human  might  be  able  to  fuse  the  data  in  an  intelligent  way  in 
many  of  these  situations,  he  probably  would  not  be  able  to  foresee  the 
possibility  of  these  situations  in  time  to  incorporate  the  necessary  knowledge 
into  an  automated  system.  Generally,  it  seems  that  "blackboards"  is  a  proper 
starting  way  for  solving  these  problems. 
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FIGURE  Cl  Trees  of  Conclusion  in  a  Production  System 
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APPENDIX  D 
OVERVIEW  OF  EXSYS 


EXSYS  is  a  generalized  personal  computer  expert  system  development 
packagelRef.  851.  It  works  on  IBM  PC.  XT,  AT  or  compatible.  It  can  create 
about  700  rules,  with  an  average  of  6  or  7  conditions,  per  64k  of  memory 
over  192k.  That  is  about  5000  rules  in  a  PC  with  640k.  The  EXSYS  programs 
are  written  in  the  C  language  producing  small,  fast  running  programs.  The 
EXSYS  development  package  consists  of  four  main  programs: 

EXSYS.EXE:  The  runtime  program  for  running  existing  expert  systen 
knowledge  bases 

EDITXS  JIB:  The  program  for  generating,  editing  and  testing  your  own 
eipert  system  knowledge  bases. 

SHRINK.EXE:  A  utility  program  to  compress  the  size  of  an  edited  knowledge 
base  and  rearrange  the  data  in  a  knowledge  base  for  rapid  access. 
FASTBR.EXE:  A  utility  program  to  rearrange  the  order  of  rules  for 
maximum  speed. 

Eipert  systems  work  with  knowledge  to  arrive  at  conclusions.  This 
knowledge  is  in  the  form  of  rules  that  both  the  user  and  the  computer  can 
understand.  The  set  of  rules  to  solve  a  particular  problem  is  often  referred 
to  as  a  knowledge  base.  A  rule  is  divided  into  five  parts,  an  IF  part,  a  THEN 
part,  an  optional  ELSE  part,  an  optional  NOTE  and  an  optional  REFERENCE. 
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IP 

THIN 

ELSE 

and 

NOTE: 


Condition 

Conditions 

Conditions 

Choices 


REFERENCE: . 

The  IF  part  is  simpiy  a  series  of  conditions,  expressed  as  English 
sentences  or  algebraic  expressions.  The  computer  tests  the  conditions 
against  the  answers  provided  by  a  user  and  information  that  can  be  derived 
from  other  rules,  to  see  if  the  IF  conditions  are  true.  The  THEN  part  is  also  a 
series  of  conditions;  however,  there  can  also  be  choices  with  their  associated 
probability  values.  The  ELSE  part  is  the  same  as  the  THEN  part  but  is 
applied  if  any  of  the  IF  conditions  are  false.  The  ELSE  part  is  optional  and 
usually  not  needed  in  most  rules.  In  some  cases  it  is  desirable  to  add  a  note 
to  a  rule  to  provide  some  special  information  to  the  user.  If  there  is  a  NOTE, 
it  will  be  displayed  with  the  rule.  The  NOTE  does  not  mean  anything  to  the 
program,  it  is  only  for  the  user's  information.  The  developer  of  the  expert 
system  knowledge  base  may  also  add  a  REFERENCE  for  a  rule.  This  is 
intended  to  help  the  user  find  the  source  of  the  knowledge  contained  in  the 
rule  or  more  information  if  they  should  neea  it.  The  REFERENCE  is 
displayed  only  if  the  user  requests  the  reference;  it  is  not  automatically 
displayed  with  the  rule. 

To  understand  how  the  EXSTS  expert  systems  are  generated,  it  i9  needed 
to  understand  the  definitions  of  the  following  terms: 
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1 .  Condition:  In  BXSTS,  there  are  two  main  types  of  conditions,  text  and 
mathematical.  For  text,  the  condition  is  made  up  of  two  parts,  a 
QUALIFIER  and  one  or  mor^  VALUES.  The  qulifier  is  usually  the  part 
of  the  condition  up  to  and  including  the  verb.  The  values  are  the 
possible  completions  of  the  sentence  started  by  the  qualifier.  A 
qualifier  can  have  up  to  30  possible  values.  If  the  developer  finds  that 
he  needs  more  than  30  values,  tries  to  divide  them  into  groups  each 
with  less  than  30  values.  Then  create  a  qualifier  that  selects  among 
the  groups  and  use  it  in  conjunction  with  new  qualifiers  for  each  of  the 
individual  groups.  When  a  new  qualifier  is  created  in  BXSTS  it  is  given 
a  list  of  possible  values  such  that  combining  the  qualifier  with  a  value 
(or  values)  makes  a  sentence.  When  more  than  one  value  is  selected, 
the  program  will  pu  or  between  the  values  and,  if  any  one  of  tho 
listed  values  is  true  the  condition  will  be  true.  A  condition  can  also  be 
formed  by  using  a  qualifier,  "NOT"  and  one  or  more  values.  The 
selection  of  qualifier  and  values  should  be  such  that  combining  them 
with  NOT.  OR  or  AND  makes  a  grammatically  correct  sentence.  For 
mathematical  conditions,  they  are  represented  as  algebraic 
expressions.  The  mathematical  expressions  usually  include 
VARIABLES.  A  variable  is  any  string  of  alphanumeric  characters, 
including  spaces,  enclosed  in  ( ].  The  first  18  characters  are  significant 
but  the  variable  can  be  up  tolOO  characters.  Only  letters,  digits  and 
spaces  can  be  used  in  variable  names.  In  this  case  the  variables  are 
used  to  create  evaluable  algebraic  expressions.  An  evaluable 
expression  can  be  any  algebraic  expression  from  a  single  number  to 
complex  expressions.  The  following  operators  are  recognized: 

*  (multiplication 
/  (division) 

+  (addition) 

-  (subtraction) 

%  (modulus  operator) 

Parentheses  can  be  used  to  group  expressions  in  the  order  of 
calculation  the  developer  desires.  Spaces  can  be  included  between 
operators  to  make  the  formula  easier  to  read.  The  following  functions 
are  supported: 

SIN(  ) 

C0S(  ) 

TAN(  ) 

ASIN(  ) 

AC0S(  ) 
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ATAN( ) 

BXP( ) 

LO G( ) 

ABS(  ) 

SQRT(  ) 

INK  ) 

The  trigonometric  expressions  are  in  radians.  The  log  and  exponential 
functions  are  base  e.  ABS  is  the  absolute  value.  SQRT  is  the  square 
root.  1NT  is  the  integer  part,  with  all  fractions  rounded  down.  The 
functions  evaluate  the  expression  in  parentheses  and  perform  the 
appropriate  function  on  the  result.  The  parenthetical  expression  must 
immediately  follow  the  function  name,  without  a  space.  A  rule  can 
have  up  to  126  conditions  in  each  of  its  IF.  THEN  or  ELSE  part. 

2.  Choice:  Choices  are  all  the  possible  solutions  to  the  problem  among 
which  the  expert  system  will  decide.  The  goal  of  EXSYS  is  to  select  the 
most  likely  choice  based  on  the  data  input,  or  to  provide  a  list  of 
possible  choices  arranged  in  order  of  likelihood.  The  choices  can  be  of 
any  form,  item,  actions,  etc.,  depending  on  what  type  of  expert  system 
is  geing  developed.  EXSYS  will  display  the  text  of  the  choice  followed 
by Probability""  and  a  number.  The  number  indicates  the 
confidence  that  the  choice  is  correct  and  is  0,  1  or  a  ratio  such  as  A/B. 
The  denominator.  B.  indicates  the  maximum  possible  value  (either  10 
or  100)  in  the  crlculational  system  being  used.  The  numerator.  A.  is 
the  probability  value  assigned  to  the  choice.  The  person  generating  the 
knowledge  base  must  select  one  of  three  options  for  how  the  program 
will  use  the  probability  data. 

The  rules  are  automatically  invoked  using  backward  chaining.  EXSYS  also 
supports  forward  chaining.  External  programs  can  be  called  by  EXSYS  for 
data  acquisition  and  calculation  and  data  can  be  passed  back  to  EXSYS  for 
analysis.  This  powerful  feature  enables  it  to  handle  a  wide  range  of 
problems.  The  expert  system  can  directly  receive  data  from  automatic 
testing  equipment,  data  bases,  some  spread  sheets  and  dedicated  programs. 
There  are  two  ways  to  call  external  programs  from  the  expert  system.  The 
simplest  is  used  to  get  data  for  a  single  variable  or  qualifier.  The  second 
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method  is  intended  for  obtaining  data  for  a  number  of  variables  or  qualifiers 
such  as  might  be  done  with  a  data  base,  spread  sheet  or  automatic  test 
equipment.  Both  can  easily  be  used  with  a  vide  range  of  programs  and 
programming  languages,  including  BASIC.  There  must  be  enough  memory 
available  for  both  EXSTS  and  the  program  called  to  run.  EXSTS  remains 
resident  in  memory  while  the  called  program  runs. 

Blackboarding  is  a  powerful  technique  by  which  more  than  one  knowledge 
base  can  share  a  common  body  of  information.  This  can  be  done  in  EXSTS 
by  having  one  one  knowledge  base  write  data  to  a  file  that  is  read  by  other 
knowledge  bases.  In  version  3.1.  special  command  options  have  been  added 
to  make  this  easier  to  do.  There  are  two  basic  ways  of  controlling  the  order 
of  execution  of  the  various  knowledge  bases.  The  first  is  through  the  use  of 
a  batch  file.  This  is  appropriate  if  the  knowledge  bases  are  intended  to  be 
called  in  a  specific,  defined  order.  Batch  files  can  include  loops  to  restart  the 
system.  The  second  technique  is  to  have  an  EXSTS  expert  system  call  other 
EXSYS  expert  system.  This  is  the  most  powerful  technique,  as  the  order  of 
excution  of  the  knowledge  bases  can  be  varied  depending  on  what  is  needed. 
Blackboarding  requires  care  in  making  sure  that  the  knowledge  bases 
sharing  the  information  each  assign  the  data  correctly,  however,  the  benefits 
in  being  able  to  segment  a  problem  and  allowing  extremely  complex  systems 
to  be  run  in  a  PC  make  the  technique  quite  worth  while. 


APEEND1X.E 

SAMPLE  OF  TRAFUSl  PRODUCTION  RULES 


Subject: 

Hard  Decision  Knowledge 'Based  Multitarget  Multisensor  Track  Fusion 


Author : 

Alaa  Eldin  M.  Fahmy 


Starting  text: 

This  Expert  System  is  an  experimental  system  to  explore  the 
applicability  of  artificial  intelligence  techniques  to  the 
implementation  of  an  automated,  extremely  flexible  track  fusion 
consultant.  The  expert  system  fuses  tracks  from  three  different  TVS 
systems,  A.-B.C.  Each  system  has  two  different  tracks,  resulting  in  a 
total  of  6  tracks.  The  tracks  are  A1,A2,B1,B2,C1,C2  from  TVS  sites 
A , B , and  C  respectively.  TVS  sites  are  covering  large  air-space  areas 
with  partially  overlapping  fields  of  view.  Track  fusion  is  based  on 
kinematic,  attribute  and  behavler  Information. 


Ending  text: 

The  expert  system  will  display  which  cracks  are  the  same  and  to. which 
target  they  are  belong.  In  addition  to  whether  targets  are  friendly 
or  hostile  targets. 


Stops  after  first  successful  rule  in  data  derivations. 
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RULES : 


RULE  NUMBER:  1 
IF: 

ABS((RAl]-[RBl])  <-  .15 
and  ABS ( [ PHlAl ] - [ PHIB1 ] )  <-  2 
and  ABS ([SPEED  OF  All -(SPEED  OF  Bl])  <-  20 
and  (IDA11  -  [IDBl] 


THEN  : 

Tracks  Al  and  Bl  are  for  the  sane  cargec  -  Probabllity-1 
and  Targets  al  and  bl  are  one  cargec 


ELSE: 

Track  Al  Is  for  target  al  -  Probability-1 
and  Track  Bl  is  for  target  bl  -  Probability-1 
and  Targecs  al  and  bl  are  different  targets 


RULE  NUMBER:  2 

IF: 

ABS([RA1]-[RB23)  <-  .15 
and  ABS ( [ PHIA1 ] - [ FHIB2  j )  <-2 
and  ABS ([SPEED  OF  Al]- (SPEED  OF  B2])  <-  20 
and  [ IDAl]  -  [IDB2] 


THEN: 

Tracks  Al  and  B2  are  for  the  same  cargec  -  Probability-1 
and  Targets  al  and  b2  are  one  target 


ELSE: 

Track  Al  is  for  target  al  -  Probability-1 
and  Track  B2  is  for  target  b2  -  Probabilicy-1 
and  Targets  al  and  b2  are  different  targets 
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RULE  NUMBER:  8 


IF: 

ABS( [RA2] - [RC2] )  <-  .2 
and  ABS ( [ PHIA2 ] - [ PHIC2 ] )  <-  3 
and  ABS ([SPEED  OF  A2 ] - [SPEED  OF  C2))  <-  30 
and  [IDA2]  -  [IDC2] 


THEN: 

Tracks  A2  and  C2  are  for  the  same  target  -  Probability-1 
and  Targets  a2  and  c2  are  one  target 


ELSE: 

Track  A2  is  for  target  a2  -  Probability-1 
and  Track  C2  is  for  target  c2  -  Probability-1 
and  Targets  a2  and  c2  are  different  targets 


RULE  NUMBER:  9 
IF: 

Tracks  Al  and  B1  are  for  the  same  target 
and  Tracks  Al  and  Cl  are  for  the  same  target 

THEN: 

Tracks  Al,  Bl,  and  Cl  are  for  the  same  target 
and  Targets  al,  bl,  and  cl  are  the  same  and  they  are  one  target 


RULE  NUMBER:  10 
IF: 

Tracks  Al  and  B2  are  for  the  same  target 
and  Tracks  Al  and  Cl  are  for  the  same  target 

THEN: 

Tracks  Al,  B2,  and  Cl  are  for  the  same  target 
and  Targets  al,  b2,  and  cl  are  the  same  and  they  are  one  target 
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RULE  NUMBER:  11 


IF: 

Tracks  Al  and  B1  are  for  the  same  carget 
and  Tracks  Al  and  C2  are  for  che  same  target 

THEN: 

Tracks  Al,  Bl,  and  C2  are  for  the  same  target 
and  Targets  al,  bl,  and  c2  are  the  same  and  they  are  one  target 


RULE  NUMBER:  12 
IF: 

Tracks  Al  and  &2  are  for  che  same  target 
and  Tracks  Al  and  C2  are  for  the  same  target 

THEN: 

Tracks  Al,  B2,  and  C2  are  for  the  same  target 
and  Targets  al,  b2,  and  c2  are  che  same  and  they  are  one  target 


RULE  NUMBER:  13 
IF: 

Tracks  A2  and  Bl  are  for  the  same  target 
and  Tracks  A2  and  Cl  are  for  the  same  carget 

THEN: 

Tracks  A2 ,  Bl,  and  Cl  are  for  che  same  target 
and  Targets  a2,  bl,  and  cl  are  the  same  and  they  are  one  carget 
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RULE  NUMBER:  18 


IF: 

Track  B1  is  for  a  target  not  filed  in  the  flight  plan 
and  Target  of  track  B1  is  not  responding  to  IFF  or  SSR 
and  The  position  of  track  B1  is  outside  all  airliner  airways 
and  Track  B1  is  conning  from  VEST 


THEN: 

Track  B1  is  for  a  hostile  aircraft  -  Probability-1 
and  Alert  friendly  forces  -  Probability-1 


ELSE: 

Track  B1  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-1 


RULE  NUMBER:  19 
IF: 

Track  B2  is  for  an  aircraft  not  filed  in  the  flight  plan 
and  Target  of  track  B2  is  not  responding  to  IFF  or  SSR 
and  The  position  of  track  B2  is  outside  all  airliner  airways 
and  Track  B2  is  conning  from  VEST 


THEN: 

Track  B2  is  for  a  hostile  aircraft  -  Probability-1 
and  Alert  friendly  forces  -  Probability-1 


ELSE: 

Track  B2  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-1 
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RULE  NUMBER:  20 


IF: 

Track  Cl  is  for  a  target  not  filed  in  the  flight  plan 
and  Target  of  track  Cl  is  not  responding  to  IFF  or  SSR 
and  The  position  of  track  Cl  is  outside  all  airliner  airways 
and  Track  Cl  is  comming  from  WEST 


THEN: 

Track  Cl  is  for  a  hostile  aircraft  -  Probabiiity-1 
and  Alert  friendly  forces  -  Probability-1 


ELSE: 

Track  Cl  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-1 


RULE  NUMBER:  21 
IF: 

Track  C2  is  for  a  target  not  filed  in  the  flight  plan 
and  Target,  of  track  C2  is  not  responding  to  IFF  or  SSR 
and  The  position  of  track  C2  is  outside  all  airliner  airways 
and  Track  C2  is  coaming  from  WEST 


THEN: 

Track  C2  is  for  a  hostile  aircraft  -  Probability-1 
and  Alert  friendly  forces  *  Probability-1 


ELTE ; 

Track  C2  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-1 


RULE  NUMBER:  22 
IF: 

ABS( (SPEED  OF  A1 ] )  <-  50 

THEN: 

Track  Ai  is  for  a  HELICOPTER 
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RULE  NUMBER:  27 


IF: 

ABS< (SPEED  OF  C2 ] )  <-  50 

THEN: 

Track  C2  is  for  a  HELICOPTER 


RULE  NUMBER:  23 
IF: 

50  <  ABS(( SPEED  OF  Alj)  <-  200 


THEN: 

Track  A1  is  for  a  CIVIL  A/C 


RULE  NUMBER:  35 


IF: 


200  <  ABS(( SPEED  OF  A2])  <-  400 


THEN: 

Track.  A2  is  for  »  BOMBER 


QUALIFIERS : 


1  Tracks  Al  and  B1  are 
for  che  same  target 

Used  in  rule(s):  9  11 


2  Tracks  Al  and  Cl  are 
for  che  same  target 

Used  in  rule (a):  9  10 


3  Tracks  Al,  Bl,  and  Cl  are 
for  che  same  target 

Used  in  rule(s):  (  9) 


4  Targets  al,  bl,  and  cl  are 
che  same  and  they  are  one  target 

Used  in  rule(s) :  (  9) 


195 


17  Tracks  A2  and  B2  are 
for  the  same  target 


Used  in  mle(s) :  14 


15 


44  Track  A2  is 

for  a  target  not  filed  in  the  flight  plan 
for  a  target  filed  in  the  flight  plan 

Used  in  rule(s):  17 


47  Track  A2  is 

comming  from  WEST 
comming  from  EAST 
comming  from  NORTH 
comming  from  SOUTH 


Used  in  rule (a):  17 


69  Track  Al  is 

for  a  HELICOPTER 
for  an  INTERCEPTOR 
for  a  BOMBER 
for  a  CIVIL  A/C 
for  a  MISSILE 


Used  in  rule(s):  (  22)  (  28)  (  34) 


40) 
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VARIABLES : 


1  R1 

Rl  Is  Che  range  of  crack  Al  ac  Cine  c 
Displayed  ac  Che  end  of  a  run 

Used  in  rule(s): 


2  R5 

R5  is  che  range  of  crack  B1  ac  Cime  t 
Used  in  rule(s): 


3  PHI1 

PHIl  is  che  bearing  of  crack  B1  ac  cine  c 
Used  in  rule(s): 


4  PHI  3 

PHI3  is  che  bearing  of  crack  B1  ac  Cine  c 
Displayed  ac  Che  end  of  a  run 

Used  in  rule(s): 


5  ID1 

ID1  is  Che  identity  of  track  Al 
Displayed  ac  che  end  of  a  run 

Used  in  rule(s) : 


6  ID3 

ID3  is  Che  Identity  of  crack  Bl 
Displayed  ac  che  end  of  a  run 

Used  in  rule(s): 
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FORMULAS : 


1  A£S(R1-R5)  <- 

Used  In  rule(s): 

2  ABS( [Rl] - [R5] )  <-  .15 

Used  in  rule(s) : 

3  ABS( [PHI1] - [ PHI 3 ] )  <-  2 

Used  in  rule(s): 

4  [ID1]  -  [ ID3] 

Used  in  rule(s)  : 

5  ABS( (SPEED  OF  Al]- [SPEED  OF  Bl] ) 

Used  in  rule(s) : 

6  ABS ( [R1 ] - [R7 ) )  <-  .15 

Used  in  rule(s) : 


198 


AEBHSD12LF 

SAMPLE  OF  TRAFUS2  PRODUCTION  RULES 


Subject: 

Soft  Decision  Knowledge- Based  Multitargec  Mulcisensor  Track  Fusion 


Author: 

Alaa  Eldin  M.  Fahmy 


Starting  text: 

This  expert  system  is  the  same  as  TRAFUS1  except  that  the 
probabilistic  reasoning  is  used.  In  this  case  as  a  degree  of  belief 
of  a  rule,  a  probabilty  ratio  (o/lO  -  10/10)  is  attached  to  its  then 
part.  The  expert  system  obtains  the  data  needed  to  make  a  decision  by 
asking  the  user  questions  relevant  to  the  tracks  needed  to  be  fused. 
The  user  can  ask  the  expert  system  how  it  arrived  at  its  final 
decision.  The  user  can  easily  test  and  analyze  the  effect  his  input 
had  on  the  final  outcome.  Also,  the  user  can  ask  why  the  expert 
system  needs  to  know  the  Information  it  is  requesting. 


Ending  text: 

The  expert  system  will  display  which  tracks  are  belonging  to  the 
same  cargot  and  which  are  not.  The  results  will  be  associated  with  a 
probability  ratio  as  a  degree  of  belief  in  it.  If  there  are  more  chan 
one  solution,  they' will  be  displayed  according  to  their  relative 
likelihood.  Whether  the  targets  are  friendly  or  hostile  will  be 
displayed  in  addition  to  its  kind  (helicopter,  fighter,  bomber,  civil 
A/C,  or  missile) 


Uses  all  applicable  rules  in  data  derivations. 


RULES: 


RULE  NUMBER:  1 
IF: 

ASS ( [ RAl ] • [ RBI ] )  <-.2 


THEN: 

Tracks  Al  and  B1  balong  Co  the  same  target  -  Probability-6/10 
and  Targacs  al  and  bl  ara  tha  same  target  and  they  are  one  target  - 
Probabilicy-6/10 


ELSE: 

Track  Al  belongs  to  target  al  -  Probability-7/10 
and  Track  Bl  belongs  to  target  bl  -  Probability-7/10 
and  Tracks  Al  and  Bl  belong  to  different  targets  al  and  bl  - 
Probability-7/10 


NOTE: 

RAl  and  RBI  are  in  ka. 


REFERENCE: 

Col.  Fahmy's  Ph.D.  dissertation,  Sep.  1987. 
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RULE  NUMBER:  5 


IF: 

ABS( [SPEED  OF  Al ] )  <-50 


THEN: 

Target  al  is  a  HELICOPTER  -  Probabilicy-8/10 


RULE  NUMBER:  6 
IF: 

50  <  ABS([ SPEED  OF  Al])  <-  200 


THEN: 

Target  al  is  a  CIVIL  A/C  •  Probability-8/10 


REFERENCE:  * 

Col.  Fahiny's  Ph.D.  dissertation,  Sep.  1987. 


RULE  NUMBER:  7 
IF: 

200  <  ABS( [SPEED  OF  Al])  <-  400 


THEN: 

Target  al  is  a  BOMBER  -  Probability-8/10 


REFERENCE : 

Col.  Fahmy's  Ph.D.  dissertation,  Sep.  1987. 
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RULE  NUMBER:  24 


IF: 

ABS ( [ RAl ] - ( RC1 ] )  <-.2 
and  ABS((PHlAl]-[PHICl])  <-3 


THEN : 

Tracks  Al  and  Cl  belong  to  the  sane  target  -  Probabllity-7/10 
and  Targets  al  and  cl  are  the  same  target  and  they  are  one  target  - 
Probability-7/10 

and  Tracks  Al  and  Cl  belong  to  different  targets  al  and  cl  - 
Probability— 4/10 


ELSE: 

Tracks  Al  and  Cl  belong  to  different  targets  al  and  cl  - 
Probability-7/10 

NOTE: 

Bearing  information  of  TUS  system  C  is  less  accurate  than  that  of  TWS 
systems  A  and  B. 


REFERENCE: 

Col.  Fahmy's  Ph.D.  dissertation,  Sep.  1987. 


RULE  NUMBER:  25 
IF: 

ABS ( [ RAl ] - [ RC1 ] )  <-.2 
and  ABS ( [ PHlAl ] - [ PHIC1 ] )  <-3 
and  ABS ([SPEED  OF  Al]- [SPEED  Or  Cl])  <-25 


THEN: 

Tracks  Al  and  Cl  belong  to  the  same  target  -  Probability-8/10 
and  Targets  al  and  cl  are  the  same  target  and  they  are  one  target  - 
Probability-8/10 

and  Tracks  Al  and  Cl  belong  to  different  targets  al  and  cl  - 
Probability-2/10 

ELSE: 

Tracks  Al  and  Cl  belong  to  different  targets  al  and  cl  - 
Probability-7/10 
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RULE  NUMBER:  53 


S 

IF: 

ABS( (RA2] * {RC1] )  <-.2 
and  ABS([PHIA2]-(PHIC1))  <-3 
and  ABS( [SPEED  OF  A2J* [SPEED  OF  Cl))  <-  25 
and  [IDA2]  -  [IDC1] 


THEN: 

Tracks  A2  and  Cl  belong  co  che  sane  target  -  Probability-9/10 
and  Targets  a2  and  cl  are  the  sane  target  and  they  are  one  target 
Probabilicy-9/10 


ELSE: 

Tracks  A2  and  Cl  belong  to  different  targets  a2  and  cl  - 
Probability-10/10 


RULE  NUMBER:  60 
IF: 

Track  B2  is  for  an  aircraft  not  filed  in  the  flight  plan 
and  Track  B2  is  for  an  aircraft  not  responding  to  IFF  or  SSR 


THEN: 

Track  B2  is  for  a  hostile  aircraft  -  Probability-7/10 


ELSE: 

Track  B2  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-7/10 


REFERENCE : 

Col.  rahmy's  Ph.D.  dissertation,  Sep.  1987. 
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RULE  HUMBER:  68 
IF: 

Track  B1  Is  for  an  aircraft  not  filed  in  the  flight  plan 
and  Track  B1  is  outside  all  airliner  airways 
and  Track  B1  is  outside  all  airliner  airways 


THEN: 

Track  B1  is  for  a  hostile  aircraft  *  Probability-8/10 


RULE  NUMBER:  69 


IF: 

Track  B1  is  for  an  aircraft  not  filed  in  the  flight  plan 
and  Track  B1  is  outside  all  airliner  airways 
and  Track  B1  is  outside  all  airliner  airways 
and  Track  B1  is  conning  fron  WEST 


THEN: 

Track  B1  is  for  a  hostile  aircraft  ♦  Probability-9/10 


RULE  NUMBER:  70 
IF: 

Track  B2  is  for  an  aircraft  not  filed  in  the  flight  plan 
and  Track  B2  is  for  an  aircraft  not  responding  to  IFF  or  SSR 

THEN: 

Track  B2  is  for  a  hostile  aircraft  •  Probability-7/10 
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RULE  NUMBER:  88 


IF: 

Track  B1  is  for  an  aircraft  responding  to  IFF  or  SSR 

THEN: 

Track  B1  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-8/10 


RULE  NUMBER:  89 
IF: 

Track  B1  is  inside  any  of  airliner  airways 

THEN: 

Track  B1  is  for  a  friendly  aircraft  or  an  airliner  -  Probability-6/10 


RULE  NUMBER:  .90 
IF: 

Track  B1  is  coaming  from  EAST  or  coaming  from  NORTH  or  coaming  from 
SOUTH 

THEN: 

Track  B1  is  fcr  a  friendly  aircraft  or  an  airliner  -  Probabllity-9/10 


RULE  NUMBER:  91 
IF: 

Track  B2  is  for  an  aircraft  filed  in  the  flight  plan 

THEN: 

Track  B2  is  for  a  friendly  aircraft  or  an  airliner  •  Probabllity-9/10 
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QUALIFIERS : 


Track  A1  is 

for  an  aircraft  filad  in  tha  flight  plan 

for  an  aircraft  not  filad  in  tha  flight  plan 

for  an  aircraft  responding  to  IFF  or  SSR 

for  an  aircraft  not  responding  to  IFF  or  SSR 

outside  all  airliner  airways 

inside  any  of  airliner  airways 

coaming  from  EAST 

coaming  from  WEST 

comming  from  NORTH 

coaming  from  SOUTH 


Used  in  rule(s): 


'■  Track  A2  is 

for  on  olrcroft  fllod  In  tho  flight  plon 
for  on  olrcroft  not  fllod  In  th.  flight  plon 
for  an  aircraft  responding  to  IFF  or  SSR 

fH  r*fc,  responding  to  IFF  or  SSR 
airliner  airways 
inside  any  of  airliner  airways 
coaming  from  EAST 
coaming  from  WEST 
coaming  from  NORTH 
coaming  from  SOUTH 


Used  in  rule(s); 
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CHOICES : 


1  Track  Al  belongs  to  target  al 

Used  in  rule(s) :  [  1]  [  2] 

2  Track  A2  belongs  to  target  a2 

Used  in  rule(s): 

3  Track  81  belongs  to  target  bl 

Used  in  rule(s):  [  1)  [  ?] 

»  Track.  B2  belongs  to  target  b2 
Used  in  rule(s): 

Track  Cl  belongs  to  target  cl 
Used  In  nile(s): 

Track  C2  belongs  to  target  c2 
Used  in  rule(s): 
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VARIABLES : 


1  RAl 

Range  of  last  report  of  track  Al 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  In  rule(s):  1  2 

12  13 

33  34 


3  4  10  11 

24  25  26  32 


2  RBI 

Range  of  last  report  of  track  B1 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule's):  1  2  3  4  4  41 

42 


3  PHIA1 

Bearing  of  last  report  of  track  Al  in  degrees 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  2  34  11  12  13 

24  25  26  32  33  34 


4  PHIB1 

Bearing  of  last  report  of  track  B1  in  degrees 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  2  3  4  40  41  42 
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12  IDB2 

Identity  code  of  track  B2 
String  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  13  45 


13  RC1 

Range  of  last  report  of  track  Cl  in  ka 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  24  25  26  51  52  53 


14  PHIC1 

Bearing  of  last  report  of  crack  Cl  in  degrees 
Numeric  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  24  25  26  51  52  53 


15  SPEED  OF  Cl 

Speed  of  crack  Cl  in  a/s 
Numeric  variable 
Displayed  at  Che  end  of  a  run 

Used  in  ruls(s):  25  26  27  28  29  30 

31  52  53 


16  IDC1 

Identity  code  of  track  Cl 
String  variable 
Displayed  at  the  end  of  a  run 

Used  in  rule(s):  26  53 
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